Senior Data Science Engineer
Role details
Job location
Tech stack
Job description
We're looking for a Machine Learning Engineer with strong experience building and deploying scalable, production-ready ML systems. You'll work with data scientists, engineers, and product teams to develop models that power dunnhumby's personalisation, recommendations, forecasting, and customer insight products, across areas such as basket understanding, sequence modelling, NLP, multimodal applications, and generative AI.
As part of the AI Strategy, Research & Enablement team, you'll help shape dunnhumby's AI direction - turning new research into practical capabilities that influence products, platforms, and long-term strategy. You'll work across data science, engineering, and product, partnering with senior leaders to guide priorities and drive innovation.
Requirements
Do you have experience in Software deployment?, * Strong hands-on experience with modern deep learning frameworks (preferably PyTorch)
- Deep expertise in transformer architectures such as BERT, GPT, T5, and Time-Series Transformers
- Experience delivering end-to-end ML pipelines, including testing, data processing, training, validation, versioning, deployment, and monitoring
- Solid understanding of MLOps tooling (e.g., MLflow, Kubeflow, Airflow, Docker, Kubernetes)
- Strong Python engineering skills, writing clean, modular, production-ready code (plus bash and Git/GitLab)
- Experience working in cloud environments, ideally GCP or Azure
- Excellent communication skills, able to explain complex concepts to both technical and non-technical audiences
- Collaborative approach when working with data scientists, product teams, and senior stakeholders
- Experience with large-scale recommendation systems or other retail-focused ML applications
- Knowledge of distributed training (e.g., DeepSpeed, PyTorch Distributed)
- Familiarity with vector databases, embeddings, and retrieval techniques
- Understanding of feature stores, metadata stores, and model registries
- Experience working with large datasets, including efficient loading, batching, and streaming (PySpark preferred)
- A practical, delivery-focused mindset that balances innovation with scalability and robustness
- Curiosity and enthusiasm for applying cutting-edge ML research to commercial problems
- A drive to raise engineering standards and help evolve dunnhumby's ML ecosystem