Senior Machine Learning Engineer

Staffworx Ltd

Charing Cross, United Kingdom

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Charing Cross, United Kingdom

Tech stack

Java

C++

Encodings

Data Infrastructure

ETL

Data Transformation

Distributed Systems

FFmpeg

Python

Machine Learning

Search Technologies

SQL Databases

Parquet

PyTorch

React

Machine Learning Operations

Front End Software Development

Software Version Control

Data Pipelines

Job description

Build and evolve a data platform (LanceDB, DataFusion, SQL and vector search) for large-scale multimodal datasets
Design ML pipelines for video indexing and processing (face detection, quality assessment, tracking)
Improve training performance across single and multi-node setups using PyTorch and Ray
Build evaluation and experimentation systems (Parquet/Iceberg) for model output analysis
Own model versioning, lifecycle management, and promotion to production
Optimise inference pipelines using Triton; build model ensembles and define request protocols

Requirements

Proven ML engineering background with a focus on infrastructure and productionisation (not just model training)
Strong Python skills, plus experience with a robust production language such as C++ or Java
Solid understanding of data pipeline performance trade-offs: I/O, compute, batching, memory layout
Hands-on PyTorch experience: training pipelines, data loading, preprocessing
Practical distributed systems experience (Ray, DDP, or similar)
Experience handling TB-scale or high-throughput data pipelines
Familiarity with columnar formats: Arrow, Parquet, Iceberg

Nice to Have

Exposure to video or visual media pipelines (FFmpeg, encoding, frame extraction)
Vector search or embedding system experience
Triton or production inference background
React/frontend for internal tooling

About the company

A privately backed AI technology company operating at the intersection of machine learning in the media industry. You will own core ML infrastructure end to end, from data ingestion and curation through to distributed training and production inference, working with large-scale multimodal datasets (video, embeddings, metadata). This is not a research role. The focus is on productionising models, building reliable platforms, and making ML systems fast and scalable in a real production environment. The ideal profile is an ML engineer transitioning from research into platform ownership - someone who is product-minded and outcome-driven rather than tech-for-tech's-sake. You should be comfortable bridging the gap between experimentation and production.

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all