HPC Systems Engineer
Role details
Job location
Tech stack
Job description
- Design and architect scalable, high-performance HPC cluster solutions for global manufacturing environments
- Lead deployment, configuration, and lifecycle management of cluster infrastructure
- Collaborate with developers and cross-functional teams to understand requirements and translate them into technical solutions
- Drive solutions from design through production, including implementation, validation, and support
- Ensure system reliability, performance, and availability across compute, storage, and networking layers
- Support ongoing operations, troubleshooting, and continuous improvement of HPC systems
- Contribute to automation, standardization, and DevOps best practices across the platform
Requirements
Systems & Infrastructure
- Deep expertise in Linux operating systems (SUSE, Red Hat, Rocky Linux, Ubuntu)
- Strong experience architecting and maintaining robust storage systems
- Solid understanding of HPC hardware ecosystems, including servers, GPUs, networking, storage, schedulers, BIOS, and BMC
- Experience with virtualization technologies such as VMware, Proxmox, or XCP-ng
Networking & Core Services
- Strong understanding of TCP/IP fundamentals and network protocols (DNS, DHCP, HTTP, LDAP, SMTP)
- Experience with file sharing technologies (NFS, CIFS)
- Familiarity with net boot/PXE and high-availability Linux configurations
Automation & DevOps
- Proficiency in scripting and development using Shell and Python
- Experience with configuration management tools (Ansible, Salt, Chef, Puppet)
- Strong DevOps mindset, including CI/CD pipelines and Git-based repositories
Platforms & Tools
- Experience with HPC schedulers (SGE, SLURM)
- Familiarity with web servers and traffic management (Apache, Nginx, reverse proxy, load balancing via HAProxy)
- Monitoring and observability tools (Prometheus, Grafana, Nagios)
- Database experience with MySQL, Doctorate (Academic) Degree and related work experience of 3+ years; Master's Level Degree and related work experience of 6+ years; Bachelor's Level Degree and related work experience of 8+ years
Benefits & conditions
Base Pay Range: $159,500.00 - $271,200.00 Annually
Primary Location: USA-CA-Milpitas-KLA
KLA's total rewards package for employees may also include participation in performance incentive programs and eligibility for additional benefits including but not limited to: medical, dental, vision, life, and other voluntary benefits, 401(K) including company matching, employee stock purchase program (ESPP), student debt assistance, tuition reimbursement program, development and career growth opportunities and programs, financial planning benefits, wellness benefits including an employee assistance program (EAP), paid time off and paid company holidays, and family care and bonding leave.
Interns are eligible for some of the benefits listed. Our pay ranges are determined by role, level, and location. The range displayed reflects the pay for this position in the primary location identified in this posting. Actual pay depends on several factors, including state minimum pay wage rates, location, job-related skills, experience, and relevant education level or training. We are committed to complying with all applicable federal and state minimum wage requirements where applicable. If applicable, your recruiter can share more about the specific pay range for your preferred location during the hiring process.