Senior AI Infrastructure Engineer

Hamilton Barnes
Birmingham, United Kingdom
5 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English
Experience level
Senior

Job location

Remote
Birmingham, United Kingdom

Tech stack

Artificial Intelligence
Cloud Engineering
Computer Networks
Continuous Integration
Linux
DevOps
Open Source Technology
OpenStack
Role-Based Access Control
AI Infrastructure
Data Logging
System Availability
Git Flow
Kubernetes
Infrastructure Automation Frameworks
Bare Metal
Performance Monitor

Job description

  • Driving Automation: Building and maintaining infrastructure-as-code and GitOps practices to ensure seamless scalability.
  • Optimizing Performance: Enabling reliable workload scheduling through Kubernetes-native tooling, container runtime optimization, and NVIDIA integrations.
  • Ensuring Resilience: Maintaining high availability and observability through proactive monitoring, logging, and incident response.
  • Strengthening Security: Implementing strong controls, including RBAC and network policies, to ensure tenant isolation.
  • Cross-Team Collaboration: Working closely with DevOps, AI, and Product teams to align infrastructure capabilities with customer needs.

Requirements

As they scale their global footprint to meet massive demand, they are seeking a Senior Infrastructure Engineer who enjoys deep technical autonomy. This is a role for a specialist who wants to move fast, solve complex problems, and have direct ownership over the stability and scalability of business-critical systems., * OpenStack Expert: Significant hands-on experience operating OpenStack in a production environment.

  • K8s Specialist: Strong experience running production-grade Kubernetes, ideally in bare-metal or private cloud setups.
  • Systems Generalist: A solid grounding in Linux, networking, and storage with a practical approach to troubleshooting.
  • Modern Workflows: Experience with infrastructure automation, CI/CD, and Git-based workflows.
  • Scale-up Mindset: The ability to thrive in a fast-moving environment with a strong sense of accountability.

Nice to Have

  • Exposure to GPU-based infrastructure, large-scale compute platforms, or HPC.
  • Familiarity with advanced networking technologies.
  • Contributions to open-source or cloud-native communities.

Benefits & conditions

  • Impact: The opportunity to make a visible, meaningful impact on a platform used by teams running compute-heavy applications.
  • Flexibility: Flexible working arrangements, including remote or hybrid options.
  • Growth: Clear career progression and the chance to help shape the company's culture and future.
  • Culture: A collaborative, transparent, and international culture built on trust.
  • Benefits: Competitive salary, annual discretionary bonus, 25 days holiday (plus public holidays), and wellbeing benefits.

Apply for this position