Systems Research Engineer

Spacelabs Healthcare
Dunfermline, United Kingdom
5 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Junior
Compensation
£ 70K

Job location

Dunfermline, United Kingdom

Tech stack

Artificial Intelligence
C++
Cloud Computing
Distributed Systems
Fault Tolerance
Systems Theories
Python
Machine Learning
Performance Tuning
Systems Development Life Cycle
Systems Architecture
Load Balancing
PyTorch
Large Language Models
Low Latency

Job description

  • Architect and implement distributed system components for AI workloads across CPU, GPU, and NPU clusters.
  • Conduct in-depth profiling and performance tuning of inference pipelines, focusing on KV cache management.
  • Develop low-latency, fault-tolerant AI serving frameworks using vLLM, Ray Serve, and PyTorch Distributed.
  • Research and prototype novel techniques for cache sharing, data locality, and resource orchestration.
  • Translate innovative designs into publishable contributions at top-tier venues (e.g., OSDI, NSDI, MLSys).
  • Collaborate with global research teams to drive the internal adoption of novel system architectures.

Requirements

  • Preferably a PhD, at minimum a masters degree, in Computer Science, distributed systems, or related field.
  • Strong knowledge of Distributed Systems, OS internals, and Machine Learning systems architecture.
  • Hands-on experience with LLM serving frameworks (vLLM, Ray Serve, TensorRT-LLM, or TGI).
  • Proficiency in C/C++ for systems development and Python for research prototyping.
  • Solid grounding in distributed algorithms, load balancing, and state management.
  • Proven ability to conduct systems research, ideally evidenced by publications in top-tier conferences.

Keywords: Systems Research Engineer / Distributed Systems / AI Infrastructure / LLM Serving / vLLM / KV Cache / Performance Optimization / Machine Learning Systems / C++ / Python / Ray Serve / PyTorch Distributed / Cloud Infrastructure

About the company

We are currently partnered with a globally leading research centre in the UK looking to expand their team with a Systems Research Engineer. This is an incredibly high calibre team reshaping how large-scale models are trained and served through next-generation AI-native infrastructure and "super-node" clusters., Systems Research Engineer | C++ Systems Engineering One of the largest telecommunications companies in the world is looking for an experienced researcher to join the company in Edinburgh. The Vision We are currently scaling a world-class research team in Edinburgh to..., SeeByte, the global leader in developing of smart software solutions for uncrewed maritime systems, are looking to recruit an Algorithms Engineer to join their team in their Edinburgh office (60% of time in office). Why join us You will have a unique opportunity to play a..., JOB DESCRIPTION Company Profile Oceaneering is a global provider of engineered services and products, primarily to the offshore energy industry. We develop products and services for use throughout the lifecycle of an offshore oilfield, from drilling to decommissioning. We...

Apply for this position