Software Engineer, Data Infrastructure & Acquisition - Reston, VA, USA
Role details
Job location
Tech stack
Job description
The Software Engineer, Data Infrastructure & Acquisition is responsible for managing data collection to support AI model training. This role enables the organization to develop large-scale, high-quality datasets efficiently by integrating infrastructure, engineering, and research efforts. The position contributes directly to advancing data capabilities that power next-generation consumer and enterprise products, collaborating closely with AI scientists and leadership to shape the data roadmap.
- Responsibilities:
-
Identify and acquire new sources of audio data for ingestion
-
Operate and enhance cloud infrastructure for data pipelines on Google Cloud Platform managed with Terraform
-
Work with AI scientists to improve data quality, scale, and cost efficiency
-
Collaborate with the AI team and leadership to develop the dataset strategy for future products
Requirements
-
BS, MS, or PhD in Computer Science or related field
-
5+ years of professional software development experience
-
Proficiency in bash and Python scripting within Linux environments
-
Experience with Docker, Infrastructure-as-Code, and at least one major cloud provider (preferably GCP)
-
Familiarity with web crawlers and large-scale data processing workflows is a plus
-
Ability to manage multiple tasks and adapt to shifting priorities
-
Strong written and verbal communication skills
Benefits & conditions
- The United States base salary range for this full-time position is $140,000 to $200,000 plus bonus and equity, depending on experience