Senior ML Infrastructure Engineer - Training Algorithms, SIML

Apple Inc.
Seattle, United States of America
6 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Seattle, United States of America

Tech stack

Algorithm Design
Computer Programming
Computer Engineering
Distributed Computing Environment
Machine Learning
Software Systems
Software Organization
PyTorch
Large Language Models
Generative AI
Infrastructure Automation Frameworks
Information Technology
Machine Learning Operations

Job description

Are you passionate about Generative AI? Are you interested in working on groundbreaking generative modeling technologies to enrich billions of people? We are the Intelligence System Experience (ISE) team within Apple's software organization. The team operates at the intersection of multimodal machine learning and system experiences. Our multidisciplinary ML teams focus on a broad spectrum of areas, including Visual Generative Foundation Models, Multimodal Understanding, Visual Understanding of People, Text, Handwriting, and Scenes, Personalization, Knowledge Extraction, Conversation Analysis, Behavioral Modeling for Proactive Suggestions, and Privacy-Preserving Learning. These innovations form the foundation of the seamless, intelligent experiences our users enjoy every day.

We are seeking engineers experienced in building infrastructure for training, adapting and deploying large-scale generative models. In this role, you will be working with closely with a cross functional team of algorithm design and infrastructure engineers to benchmark, prototype and steer algorithmic choices to best fit our training & deployment infrastructure., In this role you will be technically hands on, with deep subject matter expertise in ML infrastructure.

Responsibilities Include:

  • Training optimizations & profiling targeting vision/language pre-training

  • Researching training recipes for effective scheduling of multimodal training workloads

  • Experimentation & tooling for post-training ablations including reward modeling, distillation and prompt optimization

  • Coordinating with post-training algorithm owners for analyzing quality / performance tradeoffs of downstream capabilities

  • Ablations involving optimization aware fine-tuning

Requirements

  • Bachelors, Masters, or PhD in Electrical Engineering/Computer Science or a related field (mathematics, physics or computer engineering), with a focus on machine learning; or comparable professional experience

  • Experienced in training / adapting LLM and Diffusion models

  • Advanced Fluency in PyTorch

  • Excellent programming skills and experience contributing software to large projects

  • Experience with distributed training of large models

Preferred Qualifications

  • Strong ML Fundamentals

  • Experience working with large cross-functional and diverse teams.

About the company

Apple

Apply for this position