Software Engineer, Data Infrastructure & Acquisition - Los Angeles, CA, USA
Role details
Job location
Tech stack
Job description
The Software Engineer, Data Infrastructure & Acquisition is responsible for managing all aspects of data collection that support the organization's AI model training efforts. This role plays a key part in building and maintaining large-scale, high-quality datasets through close collaboration with scientists and engineers. The position contributes to advancing the cost efficiency, scale, and quality of data pipelines, directly impacting the development of next-generation consumer and enterprise AI products.
- Responsibilities:
-
Source new audio data and integrate it into the data ingestion pipeline
-
Operate and enhance cloud infrastructure for data ingestion, using GCP and Terraform
-
Collaborate with research scientists to improve data quality, throughput, and cost efficiency
-
Partner with AI team members and leadership to develop the dataset roadmap for future product development
Requirements
-
BS, MS, or PhD in Computer Science or a related field
-
5+ years of software development experience in industry
-
Proficiency with bash and Python scripting in Linux environments
-
Experience with Docker, Infrastructure-as-Code, and at least one major cloud provider (GCP preferred)
-
Experience with web crawlers and large-scale data workflows is a plus
-
Ability to manage multiple tasks and adapt to changing priorities
-
Strong written and verbal communication skills
Benefits & conditions
- The United States base salary range for this full-time position is $140,000 to $200,000, plus bonus and equity, depending on experience