Data Engineer
Role details
Job location
Tech stack
Job description
Are you a skilled Data Engineer looking to apply your expertise to real-world scientific challenges? Join the Research Informatics Team at the Centre for Medicines Discovery and help power cutting-edge drug discovery and translational science.
As a Data Engineer, you will play a central role in designing and delivering scalable data pipelines and systems that enable scientists to work efficiently with complex datasets. You will collaborate closely with multidisciplinary teams across the Centre for Medicines Discovery, the Centre for Human Genetics, and the wider Nuffield Department of Medicine. You will also contribute to high-profile initiatives such as OpenBind UK, working alongside leading organisations including Diamond Light Source and EMBL-EBI. You will design, implement, and maintain scalable ETL pipelines, incorporating automation, validation, quality checks, and workflow orchestration. You will also develop performant and reliable data products and services, contribute to FAIR-compliant data management strategies, and support improvements in metadata standards, interoperability, and data provenance.
Requirements
It is essential that you hold Master's degree in Computer Science, Bioinformatics, Data Science or another relevant computational discipline together with experience in designing and maintaining ETL workflows and handling large, complex or heterogeneous datasets. You will have strong proficiency in Python, with experience in building production-grade data pipelines. You will be able to manage multiple concurrent projects and deliver high-quality results to deadlines. Having a working knowledge of biology with a clear interest in biomedical data management is highly desirable.