Senior ML Engineer
Role details
Job location
Tech stack
Job description
We're hiring a Senior ML Engineer to own the productionization layer of Invoca's ML stack - model serving, inference optimization, fine-tuning, and the APIs and pipelines that tie it all together. You'll be a primary driver of the infrastructure powering our Context Engine and agentic AI workflows, working closely with Data Scientists, Data Engineers, and Applied AI Engineers.
Core Focus & Primary Ownership
- Lead End-to-End MLOps and Productionization: Architect, implement, and maintain CI/CD pipelines for ML artifacts - including model evaluation, versioning, and automated deployment. Serve as the primary SME for operational excellence across the Invoca ML stack.
- Design and Optimize SLM/LLM Deployment: Own the full inference infrastructure: model serving on Triton Inference Server, Baseten, and Kubernetes-based GPU infrastructure. Profile and tune for low latency and high throughput, and build robust, scalable APIs for internal and external model access.
Broader Contributions
- Fine-Tune Language Models: Apply parameter-efficient fine-tuning methods (LoRA, QLoRA, PEFT) to adapt transformer-based SLMs and LLMs for high-impact NLP applications in conversation intelligence.
- Evolve ML Infrastructure: Contribute to model training infrastructure, data pipelines, and data lake foundations to keep the systems powering our models reliable and scalable.
- Collaborate Across Teams: Partner closely with Data Scientists, Data Engineers, and Applied AI Engineers to build the foundational ML systems behind Invoca's agentic AI products.
- Deliver Customer Value: Work with product and engineering to understand customer needs and ship ML solutions that make a measurable difference.
Requirements
Do you have experience in Machine learning libraries?, Do you have a Bachelor's degree?, * 5+ years of ML Engineering experience with a strong production focus
- Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy)
- Demonstrated track record deploying and maintaining transformer-based NLP models in production
- Hands-on experience fine-tuning SLMs/LLMs (LoRA, QLoRA, PEFT) and optimizing models via quantization, batching, and throughput tuning
- Proficiency with inference infrastructure: Triton, Baseten, vLLM, TGI, SageMaker, Vertex AI, or similar
- Experience building production-grade APIs that expose ML models to downstream consumers
- Familiarity with MLOps tooling, model monitoring, and eval platforms (Braintrust, MLflow, or equivalent)
- B.S. in Computer Science, Engineering, Statistics, or equivalent; advanced degree a plus
- Familiarity with RLHF or preference training is a bonus, Candidates must be based within ~2 hour drive of these areas. Occasional business travel may be required.
Benefits & conditions
3.13.1 out of 5 stars San Francisco, CA Remote $152,000 - $228,000 a year, Pulled from the full job description
- Health insurance
- 401(k) matching
- Paid time off
- Vision insurance
- Dental insurance
- Family leave
- Stock options, At Invoca, all new hires in the U.S. receive benefits starting on day one of employment. Our benefits offerings include:
Please note that benefits for teammates outside the U.S. may vary in accordance with their country's laws and regulations.
- Flexible Time Off - We encourage a healthy work-life balance. Our flexible paid time off policy allows you to recharge and take time away as needed.
- Paid Holidays - Invoca provides 16 U.S. paid holidays, including a winter break, giving you ample opportunity to refresh and spend time with friends and family.
- Health Benefits - Our healthcare program includes medical, dental, and vision coverage, with multiple plan options so you can choose what works best for you and your family. Fertility assistance is also included.
- Retirement - Invoca offers a 401(k) plan through Fidelity with a company match of up to 4%.
- Stock Options - All employees are invited to share in Invoca's success through stock options.
- Mental Health Program- Well-being support on a broad range of issues is available through our SpringHealth program.
- Paid Family Leave - Up to 6 weeks of 100% paid leave is provided for baby bonding, adoption, and caring for family members.
- Paid Medical Leave - Up to 12 weeks of 100% paid leave is provided for childbirth and medical needs.
- InVacation - As a thank-you to our long-term team members, we offer a bonus after 7 years of service.
- Wellness Subsidy - We provide a subsidy that can be applied toward gym memberships, fitness classes, and more.
- Position Base Range - Salary Range $152,000 - $228,000 USD plus bonus + equity
#LI-Remote