Data Engineer
Role details
Job location
Tech stack
Job description
As a Data Engineer, you will support the development and operation of uMed's data pipelines and analytics-ready datasets. You will work closely with the Senior Data Engineer, who will provide technical guidance and mentorship while offering opportunities to grow your skills in data ingestion, transformation, and analytics enablement.
This role is ideal for someone who enjoys building reliable data pipelines and wants to deepen their experience working on a modern, AWS-based data platform.
Requirements
Data Engineering & Pipelines
- Build and maintain data pipelines for structured and semi-structured data (e.g., surveys, EHR extracts, events, and operational systems).
- Support ETL/ELT workflows from source ingestion through to analytics-ready outputs.
- Troubleshoot pipeline failures and data issues, escalating and collaborating where needed.
Data Modelling & Analytics Support
- Implement data transformations and models based on designs and guidance from the Senior Data Engineer and Product Architect.
- Help maintain analytics-ready datasets in the data warehouse to support reporting and insights.
- Contribute to datasets that support cross-study and cross-region (UK and US) analysis.
Data Quality & Reliability
- Assist in implementing data quality checks and validation rules.
- Help investigate data discrepancies and ensure data accuracy and consistency.
- Contribute to documentation and data lineage as part of standard delivery.
Collaboration & Learning
- Work closely with the Senior Data Engineer through code reviews, pairing, and technical feedback.
- Collaborate with product and analytics teams to understand data requirements.
- Follow and apply established best practices in data engineering, testing, and documentation.
Requirements
- 3-4 years of experience in data engineering or closely related roles.
- Solid SQL skills and experience working with analytics-focused datasets.
- Experience building or maintaining data pipelines in a cloud environment (AWS preferred).
- Familiarity with semi-structured data (e.g., JSON, event data).
Technical Stack (Experience With Some of the Following)
- AWS: S3, Redshift, RDS, Athena, DocumentDB (or equivalent).
- Data transformation tools (e.g., dbt or similar).
- Orchestration tools (e.g., Airflow, Step Functions, or similar).
- Python (or similar) for data processing and automation.
- BI tools and analytics enablement (e.g., Zoho Analytics, Tableau, Power BI, or similar)., * Experience in healthcare, clinical research, or regulated data environments.
- Exposure to data quality frameworks or monitoring.
- Experience working with multi-region data platforms (UK and US).
- Interest in analytics, insights, or AI-enabled data products.