Data Engineer
Role details
Job location
Tech stack
Job description
A skilled Data Engineer is required to support a major data-transformation workstream within a clinical screening and diagnostics environment. The project focuses on modernising an end-to-end screening service through modern, data-driven digital capabilities. You will join a small specialist team responsible for analysing complex datasets spread across multiple system instances, resolving data-quality challenges, and shaping future data models and structures. The core responsibility is to build secure, repeatable ingestion and transformation pipelines, apply robust data-cleansing logic, and produce auditable, reproducible outputs. Essential Skills
Requirements
Experience establishing import/export patterns, including handling data extracts, schema discovery, incremental loading and normalising data across multiple source systems. . Strong capability in data-transformation-heavy pipelines covering profiling, cleansing, standardisation, conformance and final data publishing. . Advanced SQL expertise including profiling, joins/merges, deduplication, anomaly detection and performance tuning. . Practical Python Scripting experience for automation, parsing, rules engines and data-quality checks, using libraries such as Pandas/Polars, scikit-learn or matplotlib. . Experience with modern data tooling (eg, Spark, Azure Data Factory) or the ability to deliver equivalent functionality in code-based environments. . Proven experience working with geospatial datasets (vector, raster, GeoJSON, shapefiles), including coordinate systems, spatial data handling and geospatial analysis workflows. . Ability to interpret geographical context and aggregate/upscale local or regional geospatial insights into coherent national- or region-level datasets. . Experience working with publicly available official datasets (eg, census boundaries, geographic lookups, deprivation indices, population estimates). . Capability to design rules for completeness, validity and consistency, and to implement exception handling and reconciliation flows. . Ability to build version-controlled pipelines with deterministic transformations, logging, lineage and full traceability of data changes. . Comfortable working in a secure environment with least-privilege access principles, secure storage/transfer practices and handling of sensitive personal data. Soft Skills
Technical skills alone are not enough-success in this role requires: . Strong communication and collaboration skills. . A team-focused, cooperative approach. . Enthusiasm, engagement and a positive attitude. . Proactivity-comfortable working independently without constant direction. . Ability to handle ambiguity and adapt to change effectively. Nice-to-Have Skills
. Experience working with healthcare or medical-sector datasets, including patient/episode-style records or longitudinal histories. . Experience building automated data-profiling dashboards or reporting frameworks. Additional Requirements
Candidates must be eligible for UK Security Clearance due to the sensitive nature of the data involved.