Software Engineer, Data Infrastructure & Acquisition - Tacoma, WA, USA
Role details
Job location
Tech stack
Job description
The Software Engineer, Data Infrastructure & Acquisition role supports the AI team by managing all aspects of data collection for model training. The position focuses on building and maintaining scalable and cost-efficient data ingestion pipelines to supply high-quality datasets at petabyte scale. This role collaborates closely with scientists and leadership to enhance data acquisition strategies that underpin the development of next-generation AI-powered products.
- Responsibilities:
-
Identify and integrate new audio data sources into the ingestion pipeline
-
Operate and extend cloud infrastructure for data ingestion, primarily on Google Cloud Platform using Terraform
-
Collaborate with scientists to optimize cost, throughput, and data quality
-
Participate in defining the AI team''s dataset roadmap for consumer and enterprise products
Requirements
-
BS, MS, or PhD in Computer Science or related field
-
Minimum 5 years of professional software development experience
-
Proficiency in bash/Python scripting within Linux environments
-
Experience with Docker, Infrastructure-as-Code, and at least one major cloud provider, preferably GCP
-
Familiarity with web crawlers and large-scale data processing workflows is a plus
-
Ability to manage multiple tasks and adapt to changing priorities
-
Strong written and verbal communication skills
Benefits & conditions
- The United States base salary range for this full-time position is $140,000-$200,000 plus bonus and equity, depending on experience