Software Engineer, TensorRT Specialized Platforms...

NVIDIA Ltd.
Santa Clara, United States of America
2 days ago

Role details

Contract type
Internship / Graduate position
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
$ 196K

Job location

Santa Clara, United States of America

Tech stack

Artificial Intelligence
C++
Profiling
Nvidia CUDA
Computer Engineering
Memory Management
Python
Performance Tuning
System Programming
Deep Learning
Parallel Computation
Information Technology
TensorRT
C++14
Software Performance

Job description

Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA's TensorRT team as a Software Engineer, and be at the forefront of technology, contributing to high-performance AI inference solutions for specialized platforms and applications. Your fresh perspective and technical skills will help shape the performance and functionality of our products, ensuring NVIDIA remains synonymous with innovation. If you're ready to tackle challenging projects, push the boundaries of AI performance, and make a significant impact in a company that values creativity, excellence, and teamwork, we want to hear from you!

What you'll be doing:

  • Contribute to the design and development of high-performance deep learning inference software using modern C+

  • Collaborate with teams across the hardware and software stack to understand and leverage new technologies to improve TensorRT's functionality and performance

  • Participate in the development of robust, high-quality C++ code in alignment with Modern C++ standards

  • Support systematic reasoning about test plans from unit to integration level

  • Assist in documenting the properties of functions, classes, and systems to improve robustness

  • Contribute to performance optimization and benchmarking efforts

  • Help develop new features and capabilities for TensorRT to serve specialized customer needs

Requirements

  • Masters, or PhD in relevant fields (Computer Engineering, Computer Science, Electrical Engineering, AI) or equivalent experience

  • Strong foundational C++ skills, including familiarity with C++11 and C++14 or newer standards

  • Familiarity with the C++ Standard Template Library (STL)

  • Familiarity with modern deep learning models and inference frameworks

  • Interest in performance optimization and systems programming

  • Demonstrated ability to take initiative and see projects through to completion

  • Excellent interpersonal skills and a collaborative, pragmatic approach to solving problems

Ways to stand out from the crowd:

  • Experience with Python and/or CUDA through coursework, internships, or personal projects

  • Exposure to systems programming, embedded systems, and/or compiler concepts

  • Experience in software performance analysis, profiling, or optimization techniques

  • Knowledge of C++17 or later standards

  • Understanding of computer architecture, memory management, or parallel computing concepts

Benefits & conditions

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous, and love a challenge, come join our team and help us build the future of high-performance AI inference technology!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 124,000 USD - 195,500 USD.

You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) .

Apply for this position