Principal Software Developer - GPU AI/HPC kernels

Advanced Micro Devices, Inc.
Austin, United States of America
31 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 200K

Job location

Austin, United States of America

Tech stack

Artificial Intelligence
C++
Code Generation
Nvidia CUDA
Computer Programming
Computer Engineering
Software Debugging
Firmware
Python
Knowledge Management
OpenCL
Software Engineering
Graphics Processing Unit (GPU)
Application Specific Integrated Circuits
Information Technology
Api Design
Software Performance
Code Restructuring
Software Library

Job description

AMD is looking for a principal software developer to join our growing team. As a key contributor you will be part of our ROCm GPU-compute mathematical libraries team working on innovative hardware and software technologies. You will help deliver exceptional performance and feature enhancements via maintainable code development, optimizations/tuning, and collaboration., * Develop software in C++, Python, HIP, assembly, and SOTA programming technologies to enable key mathematical operations on GPU

  • Design GPU computational software libraries for AI, HPC applications
  • Aid management in planning, and delivering industry-leading software for current and future processors
  • Supervise small development team
  • Carry-out performance optimizations and projections for important use-cases to maximize hardware utilization
  • Support development of programs to sustain seamless performance analysis, and performance/functional test coverage
  • Identify and help resolve quality issues working closely with libraries development teams and other internal engineering teams, THE ROLE: As GPU Software Architect,you willprovide technical leadership at the intersection of GPU architecture, multiASIC platformbringup, and software enablement for nextgeneration GPU products. This is a "Software-First" architecture role: you will reimagine andredefinethe end-to-endsoftware librarieslifecycle as itspansacross multipleASICs to create a unifiedsoftwarefabricand process supportingdevelopment of software libraries oncutting edgehardware.

You will serve as a bridging authority betweensoftwarearchitectureand the hardware ecosystem, ensuring that architectural intent translates intoworking, performant, and scalablesolutions forpartnershipsestablishedwithsoftware libraries teams.

This role isfocused on leading a team focusedonnew GPUs and new product introductions, with accountability spanning early architecture definition, presilicon modeling, multiASIC bringup strategy, and software readiness for emerging platforms.

THE PERSON: You are a deeply technical leader who thrives in ambiguous, firstofkind environments. You combine strong architectural intuition with handson experience bringing up complex hardware platforms and enabling software stacks on new silicon.

You are comfortable operating across layers - from architecture tradeoffs and interconnect topology down to firmware, drivers, and performance tooling - and you are trusted to make decisions when data is incomplete. You naturally connectand leadteamsthrough technical credibility, clarifyingintent, and reducingfriction between hardware and software organizations.

KEY RESPONSIBILITIES:

  • Architecture & PlatformLeadership: Provide technical leadership for GPU architecture decisions with direct impact on multiASIC platforms, interconnects, memory systems, and scalability. Translate architectural concepts into concrete platform requirements spanning ASIC, firmware, drivers, and software libraries.

  • MultiASIC BringUpStrategy: Define and lead bringup strategies for new GPU platforms, includingstrategies spanning multipleASICs. Partner with silicon, systems, and software teams toidentifyrisks early and drive mitigation plans from presilicon through first silicon.

  • Hardware-Software CoDesign: Drive hardware/software interface definition, ensuring architecture choicessupport and reflect the drive towards performance and quality.Influence firmware, driver, runtime, and performance software design to align with architectural intent.

  • Early SiliconEnablement: Act as a technical escalation point during early silicon bringup, debugging complex crosslayer issues spanning hardware, firmware, and software.Guidethe creation of diagnostics, validation tools, and bringup workflows that scale across teams and products.

  • CrossFunctional TechnicalLeadership: Work across architecture, design, verification, drivers, performance libraries, and product teams to ensure alignment. Provide technical mentorship and review, raising the overall effectiveness of teams working on new GPU platforms.

  • Knowledge Capture &Reuse: Capture lessons learned from new product bringup and translate them into reusable architecture patterns, best practices, and documentation.

  • Leverages AIassisted software development toolsto acceleratethedesign, implementation, review, and documentation of complex software libraries. Establishes best practices for responsible use of AIassistance, including validation, review, and traceability of generated code and technical artifacts., AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here.

Requirements

The successful candidate would possess excellent software skills in C/C++, Python and has expert understanding of software performance and computer architecture. Must have strong desire and ability to solve problems, and be able to schedule, and work on tasks independently. Should be a great team player with excellent communication skills., * 10+ years professional software development experience

  • Demonstrated capacity to technically lead and people manage junior to mid-level developers

  • Proficient in C/C++ & Python programming employing best software design practices

  • GPU software development or validation involvingHIP, CUDA, or OpenCL

  • Experience with software libraries and API design

  • Exposure to Matrix/Tensor operations and numerical work

  • Software emulation to support FP numerical formats is a plus

  • Experience in software performance estimations, optimizations and debugging

  • Ability to closely interact with technical leads, developers, and test teams to maintain and release production software, * Master's or Doctoral degreein Computer Science, Computer Engineering, Electrical Engineering, or equivalent, * Deep experience in GPU, accelerator, or SoC architecture, including memory systems, interconnects, and scalability considerations. History of technical leadership across distributed, crossfunctional engineering teams.

  • Strong background in systems software, firmware, drivers, or performance software used to enable new silicon. Proven experience in hardware/software codesign, including defining interfaces and debugging crosslayer issues.

  • Handson programming experience in C/C++ and Python. Familiarity with lowlevel debugging tools and workflows.Experience working with performance modeling, simulators, or early validation infrastructure.

  • Applied experience using AIassisted coding tools in professional software engineering workflows, including code generation, refactoring, test creation, documentation, and design exploration.

  • Advanced degree in Computer Engineering, Electrical Engineering, Computer Science, or equivalent practical experience.

ACADEMIC CREDENTIALS:

  • Advanced degrees, such as M.Sc., M.Eng., Ph.D. are preferred

About the company

At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

Apply for this position