Sr. Engineering Program Manager, ML Compute Infrastructure, Apple Services Engineering
Role details
Job location
Tech stack
Job description
We're looking for someone with passion in this space and an experience building large-scale infrastructure for cloud platforms with a strong knowledge of ML workflow. You will partner with the engineering teams and other product/program managers across the org to drive and influence our compute roadmap for improving engineering efficiencies, reducing cost, and ensuring resiliency for Apple's ML use cases. You will also have the opportunity to be the leader with vision, identifying new technologies for adoption and be the central PM voice for interactions with public cloud providers, such as AWS and GCP, as well as internal Apple Cloud. Come, join us and build the new frontier technologies that support Apple Intelligence!
Requirements
- 3+ years of Product and/or technical program management experience covering some or as many of the following areas as possible: distributed computing, large scale cloud infrastructure, GPU/TPU usage for ML training, software and computing architectures, container stack, and networking
- Experience in working with large scale GPU based AI applications, like Natural Language Processing and Recommendation, for training and inference or direct experience in building or managing cloud computing infrastructure and technologies
- Proficiency in multitasking and leading sophisticated programs with cross functional teams with a track record in developing and bringing outstanding platforms to market
- MS/PhD in EE, CS, Math, or Physics or equivalent work experience, * 5+ years of work experience in Product/Program and/or solutions architecture or developer roles
- Strong desire to learn, aptitude for problem solving, and the ability to make sophisticated trade-offs
- Self-motivated, independent, and proactive; demonstrated creative and critical thinking capabilities; can quickly (realtime) triage, prioritize, and lead cross-functional teams under pressure
- Highly developed drive to improve how things work, with a proven track record of driving dramatic improvements for team quality, performance, agility, or effectiveness
- Demonstrated ability to define product vision, strategy, and roadmap along with the ability to deliver and execute
- Experience driving technical partnerships with internal and external cloud, software stakeholders
- Knowledge of computer systems and cloud infrastructure architecture
- Familiarity with AI frameworks (e.g. Tensorflow, Pytorch or MxNet) and/or GPU development experience
- Excellent interpersonal skills including ability to explain sophisticated technical topics to non-experts