Cloud and GPU Infrastructure Manager
Role details
Job location
Tech stack
Job description
The Department of Computing wishes to recruit a Cloud and GPU Infrastructure Manager to be responsible for the administration, maintenance, and optimization of both our internal GPU infrastructure and our cloud-based systems., This hybrid role requires a deep understanding of GPU technologies, cloud platforms, and high-performance computing. You will work closely with various teams to support their computing needs, ensuring that our infrastructure remains robust, scalable, and efficient., * Internal GPU Infrastructure Management: Oversee the setup, configuration, and maintenance of large-scale GPU clusters within our data centres, ensuring optimal performance and reliability.
- Strategy Development: Develop, execute and evangelize GPU strategy which considers platform, architecture, security and commercial aspects of adopting technologies to support wider departmental aims.
- Commercial Management: Defining, seeking agreement and adhering to available budgets, taking into consideration both short and long term, based on wider requirements and constraints.
- Cloud Facilities Administration: Manage our cloud infrastructure on platforms such as AWS, GCP, or Azure, with a focus on GPU and computer-intensive resources.
- Performance Optimization: Continuously monitor and optimize the performance of both internal and cloud-based systems, implementing best practices for resource utilization.
- Automation and Scripting: Develop and maintain automation scripts to streamline infrastructure management tasks, reduce manual intervention, and enhance system efficiency.
You will need to stay informed about the latest advancements in GPU technologies and cloud computing, recommending and implementing improvements to our infrastructure.
Requirements
Do you have experience in Google Cloud Platform?, Department of Computing is seeking a dedicated and skilled individual to manage our internal large-scale GPU infrastructure and cloud facilities., The ideal candidate will be educated to degree level (or equivalent) in a Computer Science or a closely related subject, or equivalent experience. You will have proven experience in managing large-scale GPU infrastructure and cloud environments, in-depth knowledge of GPU hardware and software (NVIDIA, AMD), cloud platforms (AWS, GCP, Azure), and high-performance computing environments.
Strong understanding of technology contracts, financial acumen as it pertains to technology hardware/software, and partnering with Procurement teams to drive value would also be essential for this role.
Benefits & conditions
In addition to a competitive salary and package of attractive terms and conditions of service, we offer many benefits to enjoy.
These include:
- generous pension scheme
- annual cost of living review
- holiday allowance and flexible working
- learning and development opportunities