Software Engineer, Data Infrastructure & Acquisition - Richmond, VA, USA
Role details
Job location
Tech stack
Job description
The Software Engineer, Data Infrastructure & Acquisition is responsible for managing and enhancing the data collection processes that support AI model training. This role involves developing and maintaining large-scale, cost-effective datasets by integrating infrastructure, engineering, and research efforts. The position plays a critical role in advancing the organization's AI capabilities to improve next-generation consumer and enterprise products through data acquisition and pipeline optimization.
- Responsibilities:
-
Identify and integrate new audio data sources into the ingestion pipeline
-
Operate and enhance cloud infrastructure for data ingestion using GCP and Terraform
-
Collaborate with AI scientists to optimize data quality, scale, and cost
-
Contribute to the AI team's dataset roadmap in coordination with leadership
Requirements
-
BS/MS/PhD in Computer Science or related field
-
Minimum 5 years of software development experience
-
Proficiency in bash/Python scripting within Linux environments
-
Experience with Docker, Infrastructure-as-Code, and at least one major cloud provider (preferably GCP)
-
Familiarity with web crawlers and large-scale data processing workflows is advantageous
-
Ability to manage multiple priorities and adapt to changes
-
Strong written and verbal communication skills
Benefits & conditions
- Pay Range and Compensation Package:
- The United States base salary range for this full-time position is $140,000 to $200,000, plus bonus and equity, depending on experience
- Benefits & Perks:
- Not specified
Equal Opportunity Statement: Our client is an equal opportunity employer. They celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, or national origin.
Note