Staff Software Engineer (Machine Learning Platform, Inference)
Snowflake Inc.
Menlo Park, United States of America
10 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Senior Compensation
$ 339KJob location
Remote
Menlo Park, United States of America
Tech stack
Artificial Intelligence
Data Infrastructure
Machine Learning
TensorFlow
Azure
PyTorch
Large Language Models
Snowflake
Scikit Learn
Information Technology
XGBoost
Machine Learning Operations
TensorRT
Job description
- The Snowflake Machine Learning Platform team's mission is to enable customers to bring their machine learning and deep learning workloads to Snowflake
- Our customers want to build powerful models with the ever-increasing data in Snowflake but face several challenges including infrastructure optimizations, orchestration, performance, and security
- The team aims to solve these challenges by building highly integrated platform solutions that are simple, secure, and enable end-to-end ML workflows
- We are on an early journey to build the most scalable machine learning and data platform without sacrificing the benefits of a single platform and governance
- We are looking for outstanding technical leaders who will join our ML Platform team to build the next-generation platform and play a pivotal role in this journey by understanding Snowflake's core platform architecture and evolving it to enable state-of-the-art machine learning and LLM workloads
- Join us to define strategies, set technical directions, design and execute, engage and deliver innovation, and unlock the power of AI for thousands of enterprise customers
- Help define and own the roadmap, working collaboratively and proactively with senior architects, PMs, and team leadership. The initiatives include platforms and tools that enable customers to do state-of-the-art machine learning on Snowflake natively
- Collaboratively build and execute a vision for incorporating new advances in machine learning in ways that best achieve the team's business objectives
- Ensure operational excellence of the services and meet the commitments to our customers regarding reliability, availability, and performance
- Collaborate across other ML partner teams to continuously improve ML development velocity and capabilities at Snowflake
- Support team members in delivering a high level of technical quality
Requirements
- Have 7+ years of industry experience designing, building, and supporting Internet serving infrastructure, machine learning platforms, machine learning services, and frameworks
- Experience serving fine-tuned LLMs (PEFT, DPO, RL)
- Experience in serving LLMs using inference engines like vLLM, TensorRT-LLM, TEI, SGLang, and knowing tradeoffs between them
- BS/MS/PhD in Computer Science or related majors, or equivalent experience
- Have built a roadmap and vision around machine learning teams, and led technical decision making with help of architects and PMs and team
- Experience with several of the following frameworks: SKLearn, XGBoost, PyTorch, Tensorflow, MLflow is a plus
- Strong track record of working with machine learning systems and/or platforms
- Previous experience in building batch and real-time ML serving systems preferred
Benefits & conditions
- Comprehensive health insurance plans
- Health savings accounts
- Robust retirement plans
- Life and disability insurance
- Weekly online lunch and learns
- Virtual workout classes
- Ergonomic work-from-home equipment
- On-demand mental health and wellness programs
- Fertility benefits and family planning resources
- Generous time-off and various leave plans
- Onsite and Remote Work
- Employee discounts and pre-tax selections
- New hire equity + Employee Stock Purchase Plan (ESPP)
- Quarterly bonus or commission program