Senior Network Solution Architect - AI Fabrics

NVIDIA Ltd.
Santa Clara, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 288K

Job location

Remote
Santa Clara, United States of America

Tech stack

Artificial Intelligence
Systems Engineering
Bash
Border Gateway Protocol
Big Data
C++
Common Lisp Object Systems
Cloud Computing
Computer Engineering
Customer Data Management
Data Centers
Software Debugging
Linux
Network Interface Controllers
InfiniBand
Python
Network Control
Network Planning and Design
Open Shortest Path First
System Software
Virtualization Technology
Cloud-native Network Functions (CNF)
Graphics Processing Unit (GPU)
Computer Networking Systems
Cloud Platform System
Data Center Networking
Information Technology
Hardware Acceleration

Job description

NVIDIA is looking for an experienced Network Solutions Architect Engineer to help bring our next-generation AI networking platforms into production at customer data centers. Do you want to be part of a team that brings new AI hardware and software technologies to production in customer data centers? As part of the NVIDIA SA organization, you will be driving deployment of our end-to-end technology solutions integration at some of NVIDIA's most strategic technology customers, as well as offering recommendations to business and engineering teams on our product roadmap.

What you will be doing:

  • Partner with AI-native / consumer internet customers on large data center GPU and networking deployments. Guide architecture decisions across network, compute, and storage, including fabric design. Support on-site bring-up of server, network, and cluster infrastructure in customer data centers.
  • Demonstrate expertise on advanced GPU and network systems (Spectrum-X, BlueField DPU, InfiniBand/RoCE, etc.) for key accounts. Run regular technical account reviews covering roadmap alignment, cluster issues, feature discussions, and new technology introductions. Capture customer-specific requirements and translate them into concrete feedback for product, architecture, and engineering teams.
  • Analyze and debug configuration and performance issues in RoCE and InfiniBand environments. Work across NICs, switches, Linux, and system software to deliver performant, reliable AI clusters.
  • Identify and shape new project opportunities for NVIDIA GPUs, networking, and software in AI and data center use cases. Collaborate closely with Systems Engineering, Product Management, and Sales to align solutions with customer outcomes. Build targeted POCs that showcase the value of NVIDIA's networking stack (e.g., Spectrum-X fabrics, BlueField DPUs) in real customer environments.

Requirements

  • BS/MS/PhD in Electrical/Computer Engineering, Computer Science, or other Engineering fields or equivalent experience.
  • 6+ years of hands-on network engineering experience in data center or cloud environments.
  • Proven, expert-level troubleshooting of data center networks (packet-level, control plane, and fabric behavior).
  • Deep protocol knowledge of BGP, OSPF, and L2/L3 switching in large-scale data center or cloud networks (ECMP, Clos/leaf-spine). Experience with high-density switching at cloud or hyperscale is strongly preferred. Experience with InfiniBand or RoCE is a major plus.
  • Solid understanding of CPU/GPU server architecture, NICs, Linux, system software, and kernel drivers
  • Strong time management and ability to context-switch across multiple customers and projects.
  • Excellent written and verbal communication, including clear design docs, customer presentations, and root-cause summaries.

Ways to stand out from the crowd:

  • Advanced certifications: CCIE, JNCIE, or equivalent expert-level certifications.
  • Automation & tooling: Experience in Python, Bash, or C/C++ for automating network workflows, validation, and debug.
  • NVIDIA platform experience: Hands-on work with NVIDIA GPUs, NICs, DPUs, or ARM-based CPU platforms.
  • Customer-facing background: Pre-sales, post-sales, field engineering, or consulting experience with external enterprise or cloud customers.
  • Large-scale deployments: Direct experience bringing up and operating large clusters or supercomputing environments.
  • Virtualization / cloud: Familiarity with virtualization, containers, and cloud networking concepts.

We make extensive use of conferencing tools, but occasional (20%) travel is required for on-site visit to customers and industry events. We are open to remote work location and look forward to have you join our team!

Benefits & conditions

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

About the company

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

Apply for this position