Software Engineer, Data Infrastructure & Acquisition - Austin, TX, USA
Role details
Job location
Tech stack
Job description
The Software Engineer, Data Infrastructure & Acquisition is responsible for managing and expanding the data collection systems that support AI model training. This role ensures the development and maintenance of large-scale, cost-effective datasets through collaboration with infrastructure, engineering, and research teams. The position plays a critical role in advancing the quality and scale of data to power next-generation consumer and enterprise products, directly impacting the organization's AI capabilities and product innovation.
- Responsibilities:
-
Identify and source new audio data for ingestion pipelines
-
Maintain and expand cloud infrastructure for data pipelines using GCP and Terraform
-
Work with scientists to improve data cost efficiency, throughput, and quality
-
Collaborate with the AI team and leadership to develop the dataset roadmap for AI projects
Requirements
-
BS/MS/PhD in Computer Science or related field
-
5+ years of software development experience
-
Proficiency with bash and Python scripting in Linux environments
-
Experience with Docker, Infrastructure-as-Code, and major cloud providers (GCP preferred)
-
Familiarity with web crawlers and large-scale data processing workflows is a plus
-
Ability to manage multiple priorities and adapt to change
-
Strong written and verbal communication skills
Benefits & conditions
- The United States base salary range for this full-time position is $140,000-$200,000 plus bonus and equity, depending on experience