Senior Linux Kernel Systems Software Engineer...

NVIDIA Ltd.
Santa Clara, United States of America
28 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 288K

Job location

Santa Clara, United States of America

Tech stack

Artificial Intelligence
Computing Platforms
Big Data
C++
Cloud Engineering
Communications Protocols
Nvidia CUDA
Computer Programming
Computer Engineering
Data Centers
Software Debugging
Device Drivers
Microprocessors
Middleware
Embedded Software
Ethernet
Firmware
General-Purpose Computing on Graphics Processing Units
Python
Linux kernel
PCI Express
Performance Tuning
Cloud Services
Software Engineering
System Software
USB
Virtualization Technology
High Performance Computing
Deep Learning
Kubernetes
Information Technology

Job description

NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service providers with next-generation computing platforms. You will work at the intersection of hardware and software, driving technical solutions from concept through deployment.

What you'll be doing:

  • Design and develop software solutions for data center servers including Linux kernel modifications, device drivers, and system optimizations for GB200 and next-gen platforms.

  • Lead hardware bring-up activities, BSP development, and hardware-software co-design for Cloud Service Provider deployments.

  • Partner directly with CSPs to deliver technical solutions, co-develop & co-debug features and optimizations, and provide support during new product introductions.

  • Collaborate with cross-functional teams in designing end-to-end solutions spanning firmware, OS, middleware, and applications with focus on AI/ML and HPC workloads.

  • Perform advanced system debugging, root cause analysis, and performance optimization for large-scale data center environments.

  • Collaborate with AE, FAE, and Solution Architect teams to deliver integrated customer solutions and technical documentation., Do you want to join a team of highly motivated and experienced program managers who drive the successful introduction of NVIDIA's next generation GPU/CPU based products? We work closely with internal leaders in Software, Hardware, Firmware, Marketing and Operations to ensure the SW team delivers outstanding products while operating across multiple functional units and all levels of management to achieve Time-To-Market. As part of the team, your knowledge of driver, firmware, diagnostics and the SW stack development processes and priorities will enable you to swiftly make the course adjustments needed to keep these complex projects on track!

Requirements

  • Deep expertise in data center server architectures, HPC systems, and hardware-software co-design.

  • Expert knowledge of Linux kernel internals, device drivers, communication protocols (PCIe, USB, Ethernet).

  • Deep understanding of computer architecture, microprocessor concepts, and expert knowledge of ARM (aarch64) and x86 architectures.

  • Deep understanding of NUMA architectures including memory topology, processor-memory locality, and performance optimization for multi-CPU systems in data center environments.

  • Strong programming skills in C/C++, Python, plus experience with virtualization, Kubernetes, and cloud-native architectures.

  • Skilled in complex system-level debugging, performance analysis, and test design.

  • BS or MS in Computer Engineering, Computer Science, or related field (or equivalent experience).

  • 8+ years of system software development experience.

Ways to stand out from the crowd:

  • Experience with GPU computing (CUDA), deep learning workloads

  • Expertise in Out of Band and In-band management architectures

  • Knowledge of Memory fabric and CXL architectures

Benefits & conditions

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

About the company

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, hardworking and self-motivated, we want to hear from you!

Apply for this position