Software Engineer, Data Infrastructure & Acquisition - Menlo Park, CA, USA
Role details
Job location
Tech stack
Job description
The Software Engineer, Data Infrastructure & Acquisition role focuses on supporting AI model training by managing and enhancing data collection processes. The position involves developing scalable, cost-effective data ingestion systems and collaborating with AI scientists and leadership to improve and expand datasets that drive next-generation consumer and enterprise products.
- Responsibilities:
-
Identify and integrate new audio data sources into the ingestion pipeline
-
Operate and develop cloud infrastructure for data ingestion using GCP and Terraform
-
Collaborate with scientists to improve data quality, scale, and cost efficiency
-
Work with AI Team and leadership to define the dataset roadmap for future products
Requirements
-
BS/MS/PhD in Computer Science or related field
-
Over five years of software development experience
-
Proficiency with bash and Python scripting in Linux environments
-
Experience with Docker, Infrastructure-as-Code, and major Cloud Providers (GCP preferred)
-
Familiarity with web crawlers and large-scale data processing workflows is advantageous
-
Ability to manage multiple tasks and adapt to changing priorities
-
Strong written and verbal communication skills
Benefits & conditions
- The United States base salary range for this full-time position is $140,000-$200,000 plus bonus and equity, depending on experience