Software Engineer, Data Infrastructure & Acquisition - Birmingham, AL, USA
Role details
Job location
Tech stack
Job description
The Software Engineer, Data Infrastructure & Acquisition will focus on data collection to support AI model training. This role is crucial for building high-quality, large-scale datasets by integrating infrastructure, engineering, and research efforts. The position involves enhancing data ingestion pipelines, optimizing data quality and cost, and collaborating with AI scientists and leadership to develop the dataset roadmap that powers the organization's next-generation products.
- Responsibilities:
-
Identify and acquire new sources of audio data for ingestion
-
Operate and expand cloud infrastructure for data pipelines using GCP and Terraform
-
Collaborate with scientists to improve data quality, throughput, and cost efficiency
-
Work with AI team and leadership to plan and execute dataset development strategies
Requirements
-
BS/MS/PhD in Computer Science or related field
-
5+ years of software development experience
-
Proficiency with bash and Python scripting in Linux environments
-
Experience with Docker, Infrastructure-as-Code, and at least one major cloud provider (preferably GCP)
-
Experience with web crawlers and large-scale data processing workflows is a plus
-
Ability to manage multiple tasks and adapt to changing priorities
-
Strong written and verbal communication skills
Benefits & conditions
- The United States base salary range for this full-time position is $140,000-$200,000 plus bonus and equity depending on experience