Senior AI/ML Capacity and Performance Engineer
Role details
Job location
Tech stack
Job description
GM is looking for a Senior Performance Engineer to join the AV Capacity and Performance Engineering team in the AV Infrastructure org to support our critical efforts in developing autonomous vehicles. The mission of the AVCPE team is to provide input into large scale ML infrastructure strategy, advise on key decisions affecting our cloud budget, identify and execute optimization projects, and provide capacity planning and engineering expertise to support GM's efforts in developing autonomous vehicles (AV)., * Strategic Infrastructure Development: Adopt and run AVmodels tosupport GM's long-term GPU system strategy and "evergreen" infrastructure roadmap.
- Performance Optimization: Conduct deep-dive analyses of production workloads toidentifybottlenecks and propose high-impact optimization strategies.
- Cross-Functional Collaboration: Partner with AI/ML Research, Infrastructure Engineering, and Cloud Vendors to spearhead projects that enhance engineering velocity and cost-efficiency.
- Proactive System Scaling: Identifyopportunities for architectural improvements to ensure the scalability and reliability of large-scale ML training and inference environments.
Requirements
- Experience: 5+ years of professional experience in high-scale infrastructure or ML systems.
- Education: Bachelor's Degree in Computer Science, a related technical field, or equivalent practical experience.
- Software Proficiency: Expert-level coding skills in Python and the ability to architect/debug within the PyTorch ecosystem.
- Systems Engineering: Proventrack recordof resolving performance issues within large-scale distributed production environments.
- Architectural Knowledge: Deep understanding of distributed systems, specifically modern ML system design and high-performance computing (HPC).
- Containerization: Hands-on experience with Kubernetes for orchestrating complex workloads.
- GPU Monitoring: Technicalproficiencywith Nvidia DCGM , nvidia-smi , and Grafana for real-time telemetry and observability.
- Cloud Platforms: Extensive experience working within major cloud ecosystems ( AWS, GCP, or Azure ).
What w ill g ive y ou a c ompetitive e dge (Preferred Qualifications) * * * *
- Advanced Experience: 8+ years of relevant industry experience.
- Hardware Expertise: Working knowledge of Enterprise-grade Nvidia GPU architectures, including H100, B200, and GB200 .
- Model Deployment: Experience deploying and scaling open-source models via the Hugging Face ecosystem.
- Data Analytics: Proficiencyin BigQuery for large-scale data analysis and reporting.
- Profiling Tools: Practical experience utilizing Nvidia Nsight and Nsight Compute for kernel-level performance tuning.
- Soft Skills: Strong technical communication skills with the ability to translate complex infrastructure needs into actionable business insights.
Hybrid/Remote: This role is categorized as hybrid/Remote. This means the successful candidate is expected to report to Sunnyvale Technical Center at minimum three days per week or at the hiring manager's discretion. Ability to sit remote in Seattle, WA.
Benefits & conditions
Compensation: The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington.
- The salary range for this role: is $144,700 to $261,300. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.
- Bonus Potential: An incentivepayprogram offers payouts based on company performance, job level, and individual performance.
- Benefits: GM offers a variety of health and wellbeing benefit programs.Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuitionassistanceprograms, employeeassistanceprogram, GM vehicle discounts and more.
About GM
Our vision is a world with Zero Crashes, Zero Emissions and Zero Congestion and we embrace the responsibility to lead the change that will make our world better, safer and more equitable for all.
Why Join Us
We believe we all must make a choice every day - individually and collectively - to drive meaningful change through our words, our deeds and our culture. Every day, we want every employee to feel they belong to one General Motors team.