Senior ML Infrastructure Engineer - Training Algorithms, SIML

Apple Inc.

Seattle, United States of America

6 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Seattle, United States of America

Tech stack

Algorithm Design

Computer Programming

Computer Engineering

Distributed Computing Environment

Machine Learning

Software Systems

Software Organization

PyTorch

Large Language Models

Generative AI

Infrastructure Automation Frameworks

Information Technology

Machine Learning Operations

Job description

Are you passionate about Generative AI? Are you interested in working on groundbreaking generative modeling technologies to enrich billions of people? We are the Intelligence System Experience (ISE) team within Apple's software organization. The team operates at the intersection of multimodal machine learning and system experiences. Our multidisciplinary ML teams focus on a broad spectrum of areas, including Visual Generative Foundation Models, Multimodal Understanding, Visual Understanding of People, Text, Handwriting, and Scenes, Personalization, Knowledge Extraction, Conversation Analysis, Behavioral Modeling for Proactive Suggestions, and Privacy-Preserving Learning. These innovations form the foundation of the seamless, intelligent experiences our users enjoy every day.

We are seeking engineers experienced in building infrastructure for training, adapting and deploying large-scale generative models. In this role, you will be working with closely with a cross functional team of algorithm design and infrastructure engineers to benchmark, prototype and steer algorithmic choices to best fit our training & deployment infrastructure., In this role you will be technically hands on, with deep subject matter expertise in ML infrastructure.

Responsibilities Include:

Training optimizations & profiling targeting vision/language pre-training
Researching training recipes for effective scheduling of multimodal training workloads
Experimentation & tooling for post-training ablations including reward modeling, distillation and prompt optimization
Coordinating with post-training algorithm owners for analyzing quality / performance tradeoffs of downstream capabilities
Ablations involving optimization aware fine-tuning

Requirements

Bachelors, Masters, or PhD in Electrical Engineering/Computer Science or a related field (mathematics, physics or computer engineering), with a focus on machine learning; or comparable professional experience
Experienced in training / adapting LLM and Diffusion models
Advanced Fluency in PyTorch
Excellent programming skills and experience contributing software to large projects
Experience with distributed training of large models

Preferred Qualifications

Strong ML Fundamentals
Experience working with large cross-functional and diverse teams.

About the company

Apple

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all