Senior HPC Engineer - Linux
Role details
Job location
Tech stack
Job description
Senior HPC Engineer - High Performance Computing - Linux - Oxfordshire, hybrid, up to 80K, We're looking for an experienced (senior) HPC Infrastructure Engineer to design, deploy, and maintain high-performance computing clusters, container platforms (Kubernetes/Docker Swarm), and machine-learning infrastructure.
Working closely with Platform Engineering and Infrastructure teams, you will implement resilient infrastructure solutions, manage lifecycle activities, and provide 3rd line support across HPC and Linux environments.
You'll contribute to Cyber Security best practices, follow ITIL Change Management processes, and drive continual service improvement initiatives.
We would like you come with most of the following:
- Linux systems administration
- HPC cluster management and scheduling (Slurm, Kubernetes)
- Enterprise storage platforms and parallel filesystems
- InfiniBand or high-speed networking
- GPU compute workloads and scheduling
- Virtualisation (VMware, Nutanix AHV)
- Monitoring tools (Prometheus, Grafana)
- Infrastructure automation (Python, Bash, CI/CD)
- Hybrid/cloud HPC and containerised compute
You'll be collaborative, able to manage multiple priorities, and committed to delivering excellent service across the business.
Requirements
We need an experienced Senior HPC Engineer / Infrastructure Engineer (this is not a junior role) with strong Linux systems administration skills and hands-on experience in high-demand technical environments. Ideally, you'll have worked in advanced technology, simulation-heavy industries, or similar performance-critical sectors.