Software Engineer, Data Infrastructure & Acquisition - Boise, ID, USA
REMOTE HAND
Boise, United States of America
yesterday
Role details
Contract type
Permanent contract Employment type
Part-time (≤ 32 hours) Working hours
Regular working hours Languages
English Experience level
Senior Compensation
$ 200KJob location
Remote
Boise, United States of America
Tech stack
Artificial Intelligence
Bash
Big Data
Cloud Computing
Data Infrastructure
Linux System Administration
Software Engineering
Web Crawlers
Scripting (Bash/Python/Go/Ruby)
Data Ingestion
Information Technology
Terraform
Docker
Job description
- About the Opportunity: The Software Engineer, Data Infrastructure & Acquisition role is focused on supporting the AI team''s data collection efforts for model training. This position is critical in developing and managing large-scale, cost-effective datasets that improve the quality and scale of AI models powering future consumer and enterprise products. The role collaborates closely with scientists and leadership to enhance data acquisition pipelines and infrastructure., * Source new audio data and integrate it into ingestion pipelines
- Maintain and expand cloud infrastructure for data ingestion on GCP using Terraform
- Work with scientists to optimize cost, throughput, and data quality
- Develop the dataset roadmap in collaboration with the AI team and leadership
Requirements
- BS/MS/PhD in Computer Science or related field
- Minimum 5 years of software development experience
- Proficiency in bash and Python scripting within Linux environments
- Experience with Docker, Infrastructure-as-Code, and at least one major cloud provider (GCP preferred)
- Knowledge of web crawlers and large-scale data processing workflows is advantageous
- Ability to manage multiple priorities and adapt to change
- Strong written and verbal communication skills
Benefits & conditions
- Pay Range and Compensation Package:
- United States base salary range: $140,000-$200,000 plus bonus and equity, depending on experience
About the company
1. About Our Client: The organization operates in the text-to-speech technology space, addressing the challenge of reading accessibility. Its products convert various written formats into audio, enabling users to read faster and retain more information. Serving over 50 million users worldwide, the company offers multiple applications across platforms including iOS, Android, Mac, Chrome Extension, and Web. The team is fully distributed, consisting of nearly 200 professionals with backgrounds from major technology companies and academic institutions.