Software Engineer, Data Infrastructure & Acquisition - Durham, NC, USA

REMOTE HAND
Durham, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 200K

Job location

Remote
Durham, United States of America

Tech stack

Artificial Intelligence
Bash
Big Data
Cloud Computing
Data Infrastructure
Linux System Administration
Software Engineering
Web Crawlers
Scripting (Bash/Python/Go/Ruby)
Information Technology
Docker

Job description

The Software Engineer, Data Infrastructure & Acquisition is responsible for managing data collection processes that support model training operations on the AI team. This role is key to building high-quality, large-scale datasets by integrating infrastructure, engineering, and research efforts. The position focuses on sourcing and ingesting audio data, maintaining and expanding cloud infrastructure, and collaborating with scientists and leadership to develop the dataset roadmap, ultimately contributing to the development of next-generation AI models and products., * Identify and acquire new audio data sources for ingestion

Requirements

  • BS, MS, or PhD in Computer Science or related field

  • 5+ years of software development experience in industry

  • Proficiency with bash and Python scripting in Linux environments

  • Experience with Docker, Infrastructure-as-Code, and major cloud providers (GCP preferred)

  • Familiarity with web crawlers and large-scale data processing workflows is a plus

  • Ability to manage multiple tasks and adapt to changing priorities

  • Strong written and verbal communication skills

Benefits & conditions

  • The United States base salary range for this full-time position is $140,000-$200,000 plus bonus and equity, depending on experience

About the company

The organization operates in the text-to-speech technology space, addressing the challenge of reading accessibility. Its products convert various reading materials such as PDFs, books, documents, and web content into audio, enabling users to read faster and retain more information. Serving over 50 million users, the organization offers multiple platforms including mobile apps, desktop applications, and browser extensions. It operates fully remotely with a global, distributed team comprising engineers, AI researchers, and professionals from major tech companies and top academic programs. The organization has received industry recognition for its innovation and inclusivity.

Apply for this position