Cloud and GPU Infrastructure Manager

Imperial College London
25 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
£ 68K

Job location

Tech stack

Amazon Web Services (AWS)
Azure
Cloud Computing
Data Centers
Performance Tuning
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Information Technology

Job description

The Department of Computing wishes to recruit a Cloud and GPU Infrastructure Manager to be responsible for the administration, maintenance, and optimization of both our internal GPU infrastructure and our cloud-based systems., This hybrid role requires a deep understanding of GPU technologies, cloud platforms, and high-performance computing. You will work closely with various teams to support their computing needs, ensuring that our infrastructure remains robust, scalable, and efficient., * Internal GPU Infrastructure Management: Oversee the setup, configuration, and maintenance of large-scale GPU clusters within our data centres, ensuring optimal performance and reliability.

  • Strategy Development: Develop, execute and evangelize GPU strategy which considers platform, architecture, security and commercial aspects of adopting technologies to support wider departmental aims.
  • Commercial Management: Defining, seeking agreement and adhering to available budgets, taking into consideration both short and long term, based on wider requirements and constraints.
  • Cloud Facilities Administration: Manage our cloud infrastructure on platforms such as AWS, GCP, or Azure, with a focus on GPU and computer-intensive resources.
  • Performance Optimization: Continuously monitor and optimize the performance of both internal and cloud-based systems, implementing best practices for resource utilization.
  • Automation and Scripting: Develop and maintain automation scripts to streamline infrastructure management tasks, reduce manual intervention, and enhance system efficiency.

You will need to stay informed about the latest advancements in GPU technologies and cloud computing, recommending and implementing improvements to our infrastructure.

Requirements

Do you have experience in Google Cloud Platform?, Department of Computing is seeking a dedicated and skilled individual to manage our internal large-scale GPU infrastructure and cloud facilities., The ideal candidate will be educated to degree level (or equivalent) in a Computer Science or a closely related subject, or equivalent experience. You will have proven experience in managing large-scale GPU infrastructure and cloud environments, in-depth knowledge of GPU hardware and software (NVIDIA, AMD), cloud platforms (AWS, GCP, Azure), and high-performance computing environments.

Strong understanding of technology contracts, financial acumen as it pertains to technology hardware/software, and partnering with Procurement teams to drive value would also be essential for this role.

Benefits & conditions

In addition to a competitive salary and package of attractive terms and conditions of service, we offer many benefits to enjoy.

These include:

  • generous pension scheme
  • annual cost of living review
  • holiday allowance and flexible working
  • learning and development opportunities

About the company

Welcome to Imperial, a global top ten university where scientific imagination leads to world-changing impact. Join us and be part of something bigger. From global health to climate change, AI to business leadership, here at Imperial we navigate some of the world's toughest challenges. Whatever your role, your contribution will have a lasting impact. As a member of our vibrant community of 22,000 students and 8,000 staff, you'll collaborate with passionate minds across nine London campuses and a global network. This is your chance to help shape the future. We hope you'll join us at Imperial College London.

Apply for this position