Senior Software Engineer - PyTorch and AI Frameworks

NVIDIA Ltd.
Santa Clara, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 288K

Job location

Santa Clara, United States of America

Tech stack

Artificial Intelligence
C++
Compilers
Nvidia CUDA
Computer Programming
Distributed Computing Environment
Python
Performance Tuning
Software Engineering
AI Infrastructure
High Performance Computing
PyTorch
Large Language Models
Deep Learning
Generative AI
Gpu Programming
Build Management
AI Platforms
Information Technology
Machine Learning Operations
TensorRT

Job description

We are looking for experienced engineers to help build and scale next-generation AI infrastructure using PyTorch, one of the world's most widely used deep learning frameworks. This role sits at the intersection of machine learning systems, compilers, and high-performance computing, enabling researchers and product teams to train and deploy large-scale models efficiently. You will work on core components of the PyTorch ecosystem, including model execution, distributed training, performance optimization, and developer experience.

What you'll be doing:

  • Design and build core PyTorch capabilities across runtime, autograd, distributed training, and model execution
  • Optimize performance across GPU/accelerator backends (CUDA, Triton, etc.)
  • Contribute to or lead development of large-scale ML systems and infrastructure
  • Improve model training efficiency, scalability, and reliability across multi-node environments
  • Work on compilers / graph transformations / kernel optimizations to accelerate deep learning workloads
  • Partner with researchers and applied teams to translate cutting-edge models into production systems
  • Drive open-source contributions and collaborate with the broader PyTorch community
  • Influence roadmap and architecture for next-gen AI platforms
  • Work at the forefront of AI and accelerated computing
  • Direct impact on how PyTorch runs on the world's most advanced GPU platforms
  • Collaborate across hardware, systems software, and AI research to push performance boundaries and enable breakthroughs in generative AI, autonomous systems, and high-performance computing

Requirements

  • PhD or MSc degree in Computer Science, Applied Math, Physics, or related science or engineering field (or equivalent experience)
  • 8+ years of software development experience,
  • Strong programming skills in C++ and Python
  • Deep understanding of deep learning frameworks, preferably PyTorch
  • Experience with GPU programming (CUDA or similar) and performance optimization

Ways to stand out from the crowd:

  • Contributions to PyTorch core or ecosystem libraries
  • Experience with NVIDIA AI stack (TensorRT, Triton Inference Server, cuBLAS, cuDNN, NCCL)
  • Familiarity with ML compilers (TorchInductor, Triton, XLA, TVM)
  • Experience optimizing LLMs or large-scale recommendation / vision models
  • Background working closely with hardware-aware software optimization

Benefits & conditions

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits .

Apply for this position