Senior Machine Learning Engineer
Staffworx Ltd
Charing Cross, United Kingdom
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Charing Cross, United Kingdom
Tech stack
Java
C++
Encodings
Data Infrastructure
ETL
Data Transformation
Distributed Systems
FFmpeg
Python
Machine Learning
Search Technologies
SQL Databases
Parquet
PyTorch
React
Machine Learning Operations
Front End Software Development
Software Version Control
Data Pipelines
Job description
- Build and evolve a data platform (LanceDB, DataFusion, SQL and vector search) for large-scale multimodal datasets
- Design ML pipelines for video indexing and processing (face detection, quality assessment, tracking)
- Improve training performance across single and multi-node setups using PyTorch and Ray
- Build evaluation and experimentation systems (Parquet/Iceberg) for model output analysis
- Own model versioning, lifecycle management, and promotion to production
- Optimise inference pipelines using Triton; build model ensembles and define request protocols
Requirements
- Proven ML engineering background with a focus on infrastructure and productionisation (not just model training)
- Strong Python skills, plus experience with a robust production language such as C++ or Java
- Solid understanding of data pipeline performance trade-offs: I/O, compute, batching, memory layout
- Hands-on PyTorch experience: training pipelines, data loading, preprocessing
- Practical distributed systems experience (Ray, DDP, or similar)
- Experience handling TB-scale or high-throughput data pipelines
- Familiarity with columnar formats: Arrow, Parquet, Iceberg
Nice to Have
- Exposure to video or visual media pipelines (FFmpeg, encoding, frame extraction)
- Vector search or embedding system experience
- Triton or production inference background
- React/frontend for internal tooling
About the company
A privately backed AI technology company operating at the intersection of machine learning in the media industry.
You will own core ML infrastructure end to end, from data ingestion and curation through to distributed training and production inference, working with large-scale multimodal datasets (video, embeddings, metadata).
This is not a research role. The focus is on productionising models, building reliable platforms, and making ML systems fast and scalable in a real production environment.
The ideal profile is an ML engineer transitioning from research into platform ownership - someone who is product-minded and outcome-driven rather than tech-for-tech's-sake. You should be comfortable bridging the gap between experimentation and production.