Data Engineer (Python)

Hays plc
Charing Cross, United Kingdom
8 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
£ 117K

Job location

Charing Cross, United Kingdom

Tech stack

Query Performance
Unit Testing
Azure
Cloud Storage
Information Engineering
Data Systems
Python
Data Processing
Test Driven Development
Data Lake
PySpark
Serverless Computing
Docker

Job description

  • Develop and maintain data ingestion, transformation, and validation pipelines using Python and PySpark.
  • Implement unit and behavior-driven testing with Behave, ensuring robust mocking and patching of dependencies.
  • Design and maintain Delta Lake tables for optimised query performance, ACID compliance, and incremental data loads.
  • Build and manage containerised environments using Docker for consistent development, testing, and deployment.
  • Develop configurable, parameter-driven codebases to support modular and reusable data solutions.
  • Integrate Azure services, including:
  • Azure Functions for serverless transformation logic
  • Azure Key Vault for secure credential management
  • Azure Blob Storage for data lake operations

Requirements

About the RoleWe are looking for a Python Data Engineer with strong hands-on experience in Behave-based unit testing, PySpark development, Delta Lake optimisation, and Azure cloud services. This role focusses on designing and deploying scalable data processing solutions in a containerised environment, emphasising maintainable, configurable, and test-driven code delivery., * Proven experience in Python, PySpark, and Delta Lake.

  • SC Cleared
  • Strong knowledge of Behave for test-driven development.
  • Experience with Docker and containerised deployments.
  • Familiarity with Azure cloud services and data engineering best practices.
  • Ability to deliver scalable, maintainable, and testable solutions in a fast-paced environment.

Apply for this position