Software Engineer, Data Infrastructure & Acquisition - Durham, NC, USA
Role details
Job location
Tech stack
Job description
The Software Engineer, Data Infrastructure & Acquisition is responsible for managing data collection processes that support model training operations on the AI team. This role is key to building high-quality, large-scale datasets by integrating infrastructure, engineering, and research efforts. The position focuses on sourcing and ingesting audio data, maintaining and expanding cloud infrastructure, and collaborating with scientists and leadership to develop the dataset roadmap, ultimately contributing to the development of next-generation AI models and products., * Identify and acquire new audio data sources for ingestion
Requirements
-
BS, MS, or PhD in Computer Science or related field
-
5+ years of software development experience in industry
-
Proficiency with bash and Python scripting in Linux environments
-
Experience with Docker, Infrastructure-as-Code, and major cloud providers (GCP preferred)
-
Familiarity with web crawlers and large-scale data processing workflows is a plus
-
Ability to manage multiple tasks and adapt to changing priorities
-
Strong written and verbal communication skills
Benefits & conditions
- The United States base salary range for this full-time position is $140,000-$200,000 plus bonus and equity, depending on experience