Senior Cloud Software Development Engineer

Intel Corporation
Austin, United States of America
3 days ago

Role details

Contract type
Internship / Graduate position
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 245K

Job location

Austin, United States of America

Tech stack

Artificial Intelligence
Chameleon
Cloud Computing
Communications Protocols
Protocol Stack
Computer Programming
Computer Networks
Computer Engineering
Data Centers
Software Debugging
Linux
Microprocessors
Distributed Systems
Hardware Design
Machine Learning
Message Passing Interface
Performance Tuning
Scientific Computating
Software Engineering
Software Requirements Analysis
Supercomputing
Multithreading
Graphics Processing Unit (GPU)
High Performance Computing
Parallel Computation
Information Technology
Low Latency
Optimization Algorithms

Job description

Join our Communication Runtimes team as a Senior Cloud Software Development Engineer to develop cutting-edge software features and optimizations for Intel's communication libraries including Intel SHMEM (Shared Memory Access), Intel MPI (Message Passing Interface), MPICH (Message Passing Interface Chameleon), and Intel oneCCL (Collective Communications Library). This role has a primary focus on development for oneCCL, and there are opportunities to contribute to these other communication libraries., Communication Library Development

  • Design, develop, andmaintainadvanced features and performance optimizations foroneCCL, with potential to contribute toIntel SHMEM, Intel MPIandMPICHlibraries.
  • Optimizesoftware to achieve performance requirements including low latency, high bandwidth, and high reliability
  • Implement and enhance communication protocols across multiple layers of the communications stack

Cross-Functional Collaboration & Requirements

  • Collaborate with cross-functional teams to define software requirements and technical specifications
  • Work directly with scientists and engineers on high-performance computing applications and supercomputer implementations
  • Partner with hardware teams tooptimizesoftware-hardware integration for maximum performance

Performance Optimization & Analysis

  • Develop performance optimizations that improve communication latency and throughput
  • Conduct comprehensive performance analysis and benchmarking across different system configurations
  • Debug complex problems spanning multiple layers of hardware and software stack

What You'll Work On

  • Aurora Supercomputer: Direct collaboration with Argonne National Labs on one of the world's most advanced supercomputers
  • Cutting-EdgeHardware: Latest Intel GPUs and CPUs designed for data center and HPC applications
  • Impact: Meaningful contributions to scientific computing breakthroughs and machine learning advancement
  • Innovation: Development of next-generation communication libraries and optimization techniques, The Software Team drives customer value by enabling differentiated experiences through leadership AI technologies and foundational software stacks, products, and services. The group is responsible for developing the holistic strategy for client and data center software in collaboration with OSVs, ISVs, developers, partners and OEMs. The group delivers specialized NPU IP to enable the AI PC and GPU IP to support all of Intel's market segments. The group also has HW and SW engineering experts responsible for delivering IP, SOCs, runtimes, and platforms to support the CPU and GPU/accelerator roadmap, inclusive of integrated and discrete graphics.

Requirements

  • Self-driven with high motivation to learn emerging technologies
  • Outstanding analytical and problem-solving skills
  • Excellent communication skills for technical collaboration
  • Understanding of multiple levels of communications stack architecture, Minimum qualifications listed below would be obtained through a combination of industry relevant job experience, internship experience and / or schoolwork/classes/research. The preferred qualifications are in addition to the minimum requirements and are considered a plus factor in identifying top candidates, * Master's degree in Computer Science, Computer Engineering or in a STEM related field of Study
  • 3+ years of software development experience
  • 3+ years of Linux environment development experience
  • 3+ years of C and C++ programming experience
  • Experience with multithreaded programming and parallel computing concepts
  • Specialized Experience (At least one required)
  • Distributed computing systems and architectures
  • HPC (High-Performance Computing) communications libraries
  • Collective communications libraries (MPI, oneCCL/NCCL, or SHMEM)
  • GPU software development and optimization
  • Network communications stack development (one or more layers)

Preferred Qualifications

  • Ph.D. degree in Computer Science, Computer Engineering or in a STEM related field of Study
  • Experience developing performance optimizations that measurably improve communications latency or throughput
  • Experience debugging complex problems across different layers of hardware and software stack
  • Deep understanding of high-performance computing architectures and optimization techniques
  • Experience with Intel GPU and CPU architectures and their optimization characteristics
  • Knowledge of supercomputing environments and large-scale distributed systems
  • Familiarity with scientific computing and machine learning communication patterns

Benefits & conditions

We offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock bonuses, and benefit programs which include health, retirement, and vacation. Find out more about the benefits of working at Intel (https://intel.wd1.myworkdayjobs.com/External/page/1025c144664a100150b4b1665c750003) .

Annual Salary Range for jobs which could be performed in the US: $128,880.00-245,160.00 USD

About the company

This role offers the opportunity to build expertise with the latest Intel GPUs and CPUs used in data centers, collaborate directly with scientists and engineers on the Aurora supercomputer at Argonne National Labs, and make meaningful contributions that advance scientific computing and machine learning capabilities.

Apply for this position