Staff Software Engineer (Machine Learning Platform, Inference)

Snowflake Inc.

Menlo Park, United States of America

1 month ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 339K

Job location

Remote

Menlo Park, United States of America

Tech stack

Artificial Intelligence

Data Infrastructure

Machine Learning

TensorFlow

Azure

PyTorch

Large Language Models

Snowflake

Scikit Learn

Information Technology

XGBoost

Machine Learning Operations

TensorRT

Job description

The Snowflake Machine Learning Platform team's mission is to enable customers to bring their machine learning and deep learning workloads to Snowflake
Our customers want to build powerful models with the ever-increasing data in Snowflake but face several challenges including infrastructure optimizations, orchestration, performance, and security
The team aims to solve these challenges by building highly integrated platform solutions that are simple, secure, and enable end-to-end ML workflows
We are on an early journey to build the most scalable machine learning and data platform without sacrificing the benefits of a single platform and governance
We are looking for outstanding technical leaders who will join our ML Platform team to build the next-generation platform and play a pivotal role in this journey by understanding Snowflake's core platform architecture and evolving it to enable state-of-the-art machine learning and LLM workloads
Join us to define strategies, set technical directions, design and execute, engage and deliver innovation, and unlock the power of AI for thousands of enterprise customers
Help define and own the roadmap, working collaboratively and proactively with senior architects, PMs, and team leadership. The initiatives include platforms and tools that enable customers to do state-of-the-art machine learning on Snowflake natively
Collaboratively build and execute a vision for incorporating new advances in machine learning in ways that best achieve the team's business objectives
Ensure operational excellence of the services and meet the commitments to our customers regarding reliability, availability, and performance
Collaborate across other ML partner teams to continuously improve ML development velocity and capabilities at Snowflake
Support team members in delivering a high level of technical quality

Requirements

Have 7+ years of industry experience designing, building, and supporting Internet serving infrastructure, machine learning platforms, machine learning services, and frameworks
Experience serving fine-tuned LLMs (PEFT, DPO, RL)
Experience in serving LLMs using inference engines like vLLM, TensorRT-LLM, TEI, SGLang, and knowing tradeoffs between them
BS/MS/PhD in Computer Science or related majors, or equivalent experience
Have built a roadmap and vision around machine learning teams, and led technical decision making with help of architects and PMs and team
Experience with several of the following frameworks: SKLearn, XGBoost, PyTorch, Tensorflow, MLflow is a plus
Strong track record of working with machine learning systems and/or platforms
Previous experience in building batch and real-time ML serving systems preferred

Benefits & conditions

Comprehensive health insurance plans
Health savings accounts
Robust retirement plans
Life and disability insurance
Weekly online lunch and learns
Virtual workout classes
Ergonomic work-from-home equipment
On-demand mental health and wellness programs
Fertility benefits and family planning resources
Generous time-off and various leave plans
Onsite and Remote Work
Employee discounts and pre-tax selections
New hire equity + Employee Stock Purchase Plan (ESPP)
Quarterly bonus or commission program

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

Apply for this position

Good distractions

Moments

Videos View all