ML Engineering Manager Voice Models - ASR/STT
Role details
Job location
Tech stack
Job description
As an ML Engineering Manager, your mission will be to lead the team delivering the ASR backbone that powers our AI products, helping health professionals save time on documentation and focus more on patient care. You will be working in a feature team developing the speech recognition technology for Doctolib's AI-powered solutions including Consultation Assistant and Phone Assistant.
Working in the tech team at Doctolib involves building innovative products and features to improve the daily lives of care teams and patients. We work in feature teams in an agile environment, while collaborating with product, design, and business teams.
Your responsibilities include but are not limited to:
- Own the ASR roadmap end-to-end: model design, training, evaluation, and product integration for medical-grade speech recognition
- Lead and mentor a team of STT experts; foster a high bar for research rigor, code quality, and operational excellence
- Partner with MLOps to ensure training and inference pipelines are scalable, cost-efficient, and reliable in production
- Collaborate with product, design, and clinical teams to translate user needs into measurable technical objectives
- Drive continuous improvements to WER, medical term error rate, latency, diarization, domain adaptation, and multilingual performance
About our tech environment
- Our solutions are built on a single fully cloud-native platform that supports web and mobile app interfaces, multiple languages, and is adapted to the country and healthcare specialty requirements. To address these challenges, we are modularizing our platform run in a distributed architecture through reusable components.
- Our stack is composed of Rails, TypeScript, Java, Python, Kotlin, Swift, and React Native.
- We leverage AI ethically across our products to empower patients and health professionals. Discover our AI vision here and learn about our first AI hackathon here!
Requirements
Do you have experience in Swift?, Do you have a Master's degree?, * You have a Master's or Ph.D. degree in Computer Science, Data Science, or a related field
- You have at least 5 years of experience in ML with deep expertise in ASR/Speech-to-Text (end-to-end or hybrid), including streaming STT and real-time constraints
- You have hands-on experience with modern speech stacks: CTC/Transducer/Attention, Conformer/Whisper-style models, tokenizer/LM integration, diarization, and voice activity detection
- You have strong PyTorch skills and production ML experience: model serving, monitoring, A/B testing, rollback, and incident response in partnership with MLOps
- You are fluent in English
Now it would be fantastic if you have:
- Experience with multilingual ASR, on-device or low-latency inference, telephony audio, or medical domain adaptation
- Demonstrated leadership experience managing technical teams
- A passion for pushing the boundaries of speech recognition and AI in healthcare
Benefits & conditions
-
Free comprehensive health insurance for you and your children
-
25 days of paid leave and up to 14 RTT days per year
-
Parent Care Program: additional leave on top of the legal parental leave
-
Free mental health and coaching services through our partner Moka.care
-
Meal vouchers worth €8.5 per day, with €4.5 covered by Doctolib
-
For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
-
Work from EU countries and the UK for up to 10 days per year, thanks to our flexibility days policy 50% reimbursement of public transportation subscription
-
A work Council subsidy to refund part of a sport club membership or a creative class
-
Lunch voucher with Swile card