Data Platform Engineer

Allegis Global Solutions

Charing Cross, United Kingdom

2 days ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Charing Cross, United Kingdom

Tech stack

Artificial Intelligence

Data analysis

Cloud Computing

Cloud Storage

Customer Data Management

Information Engineering

Data Infrastructure

Data Integrity

Software Debugging

Identity and Access Management

Python

Software Engineering

Data Streaming

Workflow Management Systems

Data Logging

Scripting (Bash/Python/Go/Ruby)

Google Cloud Platform

Cloud Monitoring

Dynatrace

Docker

Job description

We are looking for a skilled and experienced Data Platform Engineer to join our growing team. Data Platform Engineers take full ownership of delivering high-performing, high-impact data platform as products, and services, from a description of a problem customer Data Engineers are trying to solve all the way through to final delivery (and ongoing monitoring and operations). They are standard bearers for software engineering and quality coding practices within the team and are expected to mentor more junior engineers; they may even coordinate the work of more junior engineers on a large project. They devise useful metrics ensuring their services are meeting customer demand, having an impact, and iterate to deliver and improve on those metrics in an agile fashion. The Data Platform team builds and manages reusable components and architectures designed to make it both fast and easy to build robust, scalable, production-grade data products and services in the challenging biomedical data space. A Data Platform Engineer is a technical individual contributor, building modern, cloud-native systems for standardizing and templatizing data engineering with the following skills and experiences.

Key Responsibiltites

Building a next-generation, metadata- and automation-driven data experience for GSK's scientists, engineers, and decision-makers, increasing productivity and reducing time spent on "data mechanics"
Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent.
Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time.
Automation of end-to-end data flows: Faster and reliable ingestion of high throughput data in genetics, genomics and multi-omics, to extract value of investments in new technology
Enabling governance by design of external and internal data: with engineered practical solutions for controlled use and monitoring
Innovative disease-specific and domain-expert specific data products: to enable computational scientists and their research unit collaborators to get faster to key insights leading to faster biopharmaceutical development cycles.
Supporting e2e code traceability and data provenance: Increasing assurance of data integrity through automation, integration
Improving engineering efficiency: Extensible, reusable, scalable, updateable, maintainable, virtualized traceable data and code would be driven by data engineering innovation and better resource utilization.

Requirements

Proficiency in Google Cloud Platform (GCP) - including Cloud Run, GKE, Cloud Storage, Artifact Registry, IAM, and related services
Strong Python development skills for scripting, automation, and tooling around pipeline infrastructure
Hands-on experience building and optimizing Docker containers, including multi-stage builds, image optimization, and container security best practices
Solid understanding of CI/CD pipelines for automated container builds and deployments
Demonstrated expertise in debugging and observability - including structured logging, distributed tracing, metrics collection, and use of tools such as Cloud Logging, Cloud Monitoring, or equivalent
Experience diagnosing performance and reliability issues in containerized, cloud-native environments

Preffered Skills:

Familiarity with Nextflow for workflow orchestration
Exposure to bioinformatics, genomics data, cell imaging and histopathology imaging processing workflows
Experience with GCP Batch for running large-scale computational workloads

About the company

GSK is a science-led global healthcare company with a special purpose: to help people do more, feel better, live longer. We are on an audacious journey to impact the health of 2.5 billion people over the next decade. Our R&D division is at the forefront of this mission, dedicated to the discovery and development of groundbreaking vaccines and medicines. We are transforming the landscape of medical research by integrating cutting-edge science and technology and harnessing the power of genetics and new data. By fostering a collaborative environment that unites the talents of our people, we are revolutionizing R&D to pre-empt and defeat diseases. Join us in our commitment to uniting science, technology, and talent to get ahead of disease together., Uniting science, technology and talent to get ahead of disease together. GSK is a global biopharma company with a special purpose - to unite science, technology and talent to get ahead of disease together - so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns - as an organisation where people can thrive. We prevent and treat disease with vaccines, specialty and general medicines. We focus on the science of the immune system and the use of new platform and data technologies, investing in four core therapeutic areas (infectious diseases, HIV, respiratory/ immunology and oncology). Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it's also about making GSK a place where people can thrive. We want GSK to be a place where people feel inspired, encouraged and challenged to be the best they can be. A place where they can be themselves - feeling welcome, valued, and included. Where they can keep growing and look after their wellbeing. So, if you share our ambition, join us at this exciting moment in our journey to get Ahead Together. Inclusion at GSK GSK is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class. If you need any adjustments in the recruitment process, please get in touch with our Recruitment team (EMEA-GSKLink@allegisglobalsolutions.com) to further discuss this today. Important notice to employment businesses/agencies