Sr Software Engineer, Audio Intelligence
Role details
Job location
Tech stack
Job description
We are seeking a Sr Audio Intelligence Engineer to build conversational and audio AI experiences for the smart home. This role will develop real-time speech, audio understanding, and multimodal interaction systems across mobile, panel, camera, and future agentic experiences. In this role, you will be responsible to:
- Design and build conversational and audio AI experiences using STT, TTS, LLMs, audio understanding, and multimodal context.
- Optimize latency, reliability, privacy, grounding, and user experience for real-time interactions.
- Build evaluation frameworks for conversational quality, task completion, audio understanding, customer satisfaction, and safety.
- Integrate audio intelligence with agentic tool use, memory, personalization, and smart-home context.
- Evaluate and benchmark emerging speech, audio, and multimodal AI technologies.
- Deploy and monitor production audio and conversational AI services at scale.
Requirements
- Bachelor's degree in Computer Science, Software Engineering, AI/ML, or a related technical field, and 5+ years of professional experience in software development, applied science, or ML engineering; or
- Master's degree in Computer Science, Software Engineering, AI/ML, or a related technical field, and 2+ years of professional experience in software development, applied science, or ML engineering
- Experience with speech-to-text, text-to-speech, audio understanding, or conversational AI systems
- Strong Python software engineering skills
- Experience integrating LLMs or multimodal models into production applications
- Familiarity with prompt engineering, evaluation, conversation design, latency optimization, and cloud AI services
- Experience with Git, CI/CD, production monitoring, and cross-functional product collaboration
Preferred Qualifications:
- Experience with wake word, streaming audio, real-time inference, ASR/NLU frameworks, or ambient sound classification
- Experience building consumer-facing voice, assistant, smart-home, or conversational products
- Familiarity with edge AI, embedded devices, IoT, Linux-based systems, or hybrid edge/cloud architectures
- Experience with privacy-aware audio systems, grounding, safety, and customer-trust mechanisms
- Experience with Docker, Kubernetes, GCP/AWS, or scalable AI service deployment
Benefits & conditions
The base salary range for this position is: $150K to $ 180K. The base salary range above represents the low and high end of the salary range for this position. Actual salaries will vary based on several factors including but not limited to location, experience, and performance. The range listed is just one component of the total compensation package for employees. Other rewards may include annual bonus, short- and long-term incentives, and program-specific awards. In addition the position may be eligible to participate in the benefits program which include, but are not limited to, medical, vision, dental, 401K, and flexible spending accounts.