Machine Learning Engineer - LLM post-training/mid-training

EPM Ltd.

Bristol, United Kingdom

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Bristol, United Kingdom

Tech stack

Artificial Intelligence

Python

Machine Learning

PyTorch

Large Language Models

Information Technology

Data Generation

Job description

Our team is partnered with a materials discovery stealth venture based in San Francisco and London, led by former Oxford and Isomorphic Labs leaders in AI and experimental science. The team is pioneering large-scale language models that reason, adapt, and accelerate discovery workflows. They are combining experimental validation, synthetic data generation, and scalable infrastructure to push the boundaries of autonomous research. This is a rare chance to be part of the founding team, shaping technical direction and building systems that redefine how science is done., * You will design and implement novel approaches for model adaptation and reasoning, exploring techniques that improve generalization, controllability, and scientific understanding.

This includes mid-training strategies, post-training alignment, and inference-time optimization for complex workflows.
You'll also develop new reasoning paradigms such as retrieval-augmented and tool-augmented approaches and build robust evaluation frameworks for applied scientific contexts.

Requirements

PhD. or equivalent experience in Computer Science, Machine Learning, or a closely related discipline
Practical experience working with LLM training pipelines, including pre-training, mid-training, or post-training stages
Strong grasp of transformer architectures, optimization techniques, and representation learning principles
Proficiency in Python and familiarity with major ML frameworks such as PyTorch, DeepSpeed, or JAX
Knowledge of alignment and reasoning strategies, including in-context learning, chain-of-thought, tool integration, or retrieval-augmented approaches
Ability to combine innovative research thinking with pragmatic engineering to deliver scalable, high-performance systems