HPC Systems Administrator (High-Performance Computing)
Role details
Job location
Tech stack
Job description
Discipline: Intelligent Networked Systems Institute
Location: Northeastern University - London, London, UK
Term: Full-time, fixed term (18 months), with possibility of extension
Direct Reports: N/A
Reports to: To be determined
Benefits: The university supports staff maintaining a good work-life balance, offering: (i) flexible working and parental leave opportunities; (ii) an employee assistance programme which provides free, confidential advice on both home and work concerns, as well as optional private medical insurance; (iii) season ticket loans; and (iv) being part of the cycle-to-work scheme.
Start: July 1, 2026 (or earlier based on candidate availability)
The Role
Northeastern University - London (NU London), in collaboration with Northeastern University's Research Computing (RC) team and the Intelligent Networked Systems Institute (INSI), invites applications for a highly motivated and adaptable HPC Systems Administrator to design, operate, and evolve the AI and high-performance computing infrastructure that underpins experiential research and scholarship at NU London.
The successful candidate will play a leading role in the architecture, deployment, administration, and growth of NU London's research computing cluster, while integrating Northeastern University's Explorer cluster and cloud-based computing resources into a cohesive and resilient computer ecosystem supporting research and teaching.
The primary objective of the role is to ensure the reliability, security, performance, and scalability of research computing systems by proactively monitoring, maintaining, and enhancing HPC and AI infrastructure. The role involves close collaboration with IT support specialists, research support teams, and academic researchers, providing both operational expertise and strategic guidance. You will be expected to stay current with emerging trends in HPC, AI, and cloud technologies, contributing insights that inform long-term infrastructure planning and service development for NU London's research enterprise. You will also provide hands-on, on-site support and equipment maintenance as required.
Key Responsibilities
- Manage, monitor, and maintain research computing hardware and software systems, ensuring reliability, performance, scalability, and security across NU London's HPC ecosystem.
- Deploy, configure, and maintain HPC workload managers and schedulers (e.g., Slurm, PBS, LSF), including queue configuration, fair-share resource allocation, job monitoring, and performance optimization.
- Install, maintain, and optimize GPU software stacks, including NVIDIA drivers, CUDA, cuDNN, NCCL, and GPU-aware MPI libraries.
- Develop and maintain automation, configuration management, and infrastructure-as-code solutions to improve reliability and operational efficiency.
- Deploy and maintain scientific software environments using tools such as Spack, EasyBuild, Conda, and environment modules.
- Design, deploy, configure, and document core services, including cluster resource management and scheduling, high-performance storage and backup systems, data lifecycle management, user lifecycle management, and authentication and authorization frameworks.
- Implement and maintain secure access controls for research systems, including identity federation, key-based authentication, and authorization mechanisms, while ensuring compliance with institutional policies, data protection regulations, and research governance requirements.
- Diagnose, troubleshoot, and resolve system issues across hardware, software, networking, storage, and distributed computing environments, ensuring a stable and performant research computing platform.
- Provide technical support and training to staff and students, supporting skills development and promoting effective use of research computing resources.
- Collaborate closely with faculty to understand evolving computational requirements and develop new systems, workflows, and solutions that support both research and teaching.
- Work with Northeastern's Research Computing team to define and deliver short- and long-term strategies for expanding infrastructure, services, and capabilities at NU London.
- Write and curate technical documentation, including internal administrative documentation and external user-facing guides.
- Communicate progress, risks, and outcomes through regular updates, technical reviews, and strategic discussions with researchers and senior management.
- Participate in conferences, workshops, and regional collaborations, contributing to professional development, external partnerships, and funding opportunities while helping expand NU London's research computing portfolio.
About the Institute
The Intelligent Networked Systems Institute (INSI) at NU London is a newly established transatlantic research hub focused on next-generation connectivity and intelligent computing and learning systems. The INSI London Hub acts as a bridge between the US and Europe, advancing interdisciplinary collaboration across continents. INSI is dedicated to addressing pressing challenges in telecommunications, artificial intelligence, quantum information, and information processing-all with the objective of creating a more efficient, connected, and secure world.
About the University
Founded in 2012, Northeastern University London (formerly known as New College of the Humanities) has established itself as a prestigious higher education institution based in the heart of London. Northeastern University London is part of Northeastern University's Global Campus Network. Positioned as top ranking in the USA, Northeastern has campuses in Arlington, Boston, Charlotte, Miami, Oakland, San Jose, and Seattle in the USA, and Toronto and Vancouver in Canada.
After becoming part of Northeastern University's global network in early 2019, the University has undergone a period of rapid growth, having been granted Taught Degree Awarding Powers in February 2020 and moved to a new, state-of-the-art campus in 2021.
Northeastern University London offers a vibrant, collegiate community delivering broad and academically rigorous degree programmes, designed to equip graduates with the combination of skills and knowledge that are increasingly desired by organisations, employers, and society. It offers state of the art audio visual technology in its teaching and meeting spaces across the organisation.
Person Specification Criteria
To undertake this role, the following should apply - should you not have the experience below, please do highlight where transferable skills would assist with you undertaking the role.
Qualifications
- Bachelor's degree in a computation field with Linux systems experience, or equivalent professional experience in Linux system administration.
Key Criteria
- Demonstrated experience or foundational knowledge administering and supporting HPC Linux clusters in a production research environment, including cluster provisioning, configuration, and lifecycle management
- Good working knowledge in Linux system administration, including operating systems, networking, storage, security, and user management
- Experience with or interest in working with accelerator-based systems for AI and HPC (e.g., multi-GPU servers)
- Experience configuring, managing, and supporting batch scheduling systems (mainly, Slurm)
- Proficiency in scripting and automation using at least one scripting or programming language (e.g., Bash, Python, Ruby, C/C++)
- Basic working knowledge of identity and access management systems (e.g., LDAP, Active Directory)
- Familiarity with storage systems and filesystems (e.g., GPFS, Lustre, or equivalent)
- Experience with or interest in working with automation, configuration management, and infrastructure-as-code tools (e.g., Ansible, Puppet, Chef, Salt)
- Familiarity with common HPC software stacks, system management tools, and research computing workflows
- Familiarity with virtualization, containerisation, and cloud computing platforms, including technologies such as Docker, AWS, Azure, or GCP
- Basic knowledge of database concepts and administration (e.g., MySQL, PostgreSQL, MongoDB)
- Good analytical and troubleshooting skills, with the ability to diagnose and resolve complex system, performance, and integration issues
- Excellent verbal and written communication skills, with the ability to work effectively with technical colleagues and end users
- Ability to work both independently and collaboratively in a fast-paced, evolving technical environment
- Ability to manage multiple priorities and work streams while maintaining high standards of reliability and attention to detail
- Demonstrated commitment to continuous professional development and maintaining up-to-date technical skills
Additional Information
Enquiries
Informal enquiries may be made to Bipin Rajendran (b.rajendran@northeastern.edu) and Alexandros Koliousis (a.koliousis@northeastern.edu). However, all applications must be made in accordance with the application process specified below.
Application Process
Applications should be made via the 'Apply' button above by 23:59 on 20 May 2026. Participation in the equal opportunities section is encouraged, but voluntary. Applications must include:
- full curriculum vitae
- cover letter that sets out concisely and in ways relevant to this role how the applicant meets the person specification criteria
- names and contact information for at least two individuals who can each provide a letter of recommendation (references will only be sought for short-listed candidates)
The panel will be shortlisting for this position on a rolling basis so please apply as soon as possible. We reserve the right to close this post before the closing date if we receive many applications.
We do not require assistance from recruitment agencies. Any applications made by recruitment agencies will be returned.
Please note this role may require a basic or enhanced DBS check. Our organisation acknowledges the duty of care to safeguard, protect and promote the welfare of our students and staff, and is committed to ensuring safeguarding practice reflects statutory responsibilities, government guidance and complies with best practice and Ofsted requirements. You must adhere to the above if you are offered a role with NU London.
Applications are welcome from all sections of the community and will be judged on merit alone. We welcome applications from underrepresented groups. Candidates must be able to demonstrate their eligibility to work in the UK in accordance with the Immigration, Asylum and Nationality Act 2006.
Job sponsorship. Visa sponsorship is not available for a successful candidate for this position.
£40,000 to £50,000 per annum, commensurate with experience and qualifications
Requirements
- Bachelor's degree in a computation field with Linux systems experience, or equivalent professional experience in Linux system administration., * Demonstrated experience or foundational knowledge administering and supporting HPC Linux clusters in a production research environment, including cluster provisioning, configuration, and lifecycle management
- Good working knowledge in Linux system administration, including operating systems, networking, storage, security, and user management
- Experience with or interest in working with accelerator-based systems for AI and HPC (e.g., multi-GPU servers)
- Experience configuring, managing, and supporting batch scheduling systems (mainly, Slurm)
- Proficiency in scripting and automation using at least one scripting or programming language (e.g., Bash, Python, Ruby, C/C++)
- Basic working knowledge of identity and access management systems (e.g., LDAP, Active Directory)
- Familiarity with storage systems and filesystems (e.g., GPFS, Lustre, or equivalent)
- Experience with or interest in working with automation, configuration management, and infrastructure-as-code tools (e.g., Ansible, Puppet, Chef, Salt)
- Familiarity with common HPC software stacks, system management tools, and research computing workflows
- Familiarity with virtualization, containerisation, and cloud computing platforms, including technologies such as Docker, AWS, Azure, or GCP
- Basic knowledge of database concepts and administration (e.g., MySQL, PostgreSQL, MongoDB)
- Good analytical and troubleshooting skills, with the ability to diagnose and resolve complex system, performance, and integration issues
- Excellent verbal and written communication skills, with the ability to work effectively with technical colleagues and end users
- Ability to work both independently and collaboratively in a fast-paced, evolving technical environment
- Ability to manage multiple priorities and work streams while maintaining high standards of reliability and attention to detail
- Demonstrated commitment to continuous professional development and maintaining up-to-date technical skills
Benefits & conditions
£40,000 to £50,000 per annum, commensurate with experience and qualifications