Data Platform Engineer

Allegis Global Solutions
Charing Cross, United Kingdom
2 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Charing Cross, United Kingdom

Tech stack

Artificial Intelligence
Data analysis
Cloud Computing
Cloud Storage
Customer Data Management
Information Engineering
Data Infrastructure
Data Integrity
Software Debugging
Identity and Access Management
Python
Software Engineering
Data Streaming
Workflow Management Systems
Data Logging
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Cloud Monitoring
Dynatrace
Docker

Job description

We are looking for a skilled and experienced Data Platform Engineer to join our growing team. Data Platform Engineers take full ownership of delivering high-performing, high-impact data platform as products, and services, from a description of a problem customer Data Engineers are trying to solve all the way through to final delivery (and ongoing monitoring and operations). They are standard bearers for software engineering and quality coding practices within the team and are expected to mentor more junior engineers; they may even coordinate the work of more junior engineers on a large project. They devise useful metrics ensuring their services are meeting customer demand, having an impact, and iterate to deliver and improve on those metrics in an agile fashion. The Data Platform team builds and manages reusable components and architectures designed to make it both fast and easy to build robust, scalable, production-grade data products and services in the challenging biomedical data space. A Data Platform Engineer is a technical individual contributor, building modern, cloud-native systems for standardizing and templatizing data engineering with the following skills and experiences.

Key Responsibiltites

  • Building a next-generation, metadata- and automation-driven data experience for GSK's scientists, engineers, and decision-makers, increasing productivity and reducing time spent on "data mechanics"
  • Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent.
  • Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time.
  • Automation of end-to-end data flows: Faster and reliable ingestion of high throughput data in genetics, genomics and multi-omics, to extract value of investments in new technology
  • Enabling governance by design of external and internal data: with engineered practical solutions for controlled use and monitoring
  • Innovative disease-specific and domain-expert specific data products: to enable computational scientists and their research unit collaborators to get faster to key insights leading to faster biopharmaceutical development cycles.
  • Supporting e2e code traceability and data provenance: Increasing assurance of data integrity through automation, integration
  • Improving engineering efficiency: Extensible, reusable, scalable, updateable, maintainable, virtualized traceable data and code would be driven by data engineering innovation and better resource utilization.

Requirements

  • Proficiency in Google Cloud Platform (GCP) - including Cloud Run, GKE, Cloud Storage, Artifact Registry, IAM, and related services
  • Strong Python development skills for scripting, automation, and tooling around pipeline infrastructure
  • Hands-on experience building and optimizing Docker containers, including multi-stage builds, image optimization, and container security best practices
  • Solid understanding of CI/CD pipelines for automated container builds and deployments
  • Demonstrated expertise in debugging and observability - including structured logging, distributed tracing, metrics collection, and use of tools such as Cloud Logging, Cloud Monitoring, or equivalent
  • Experience diagnosing performance and reliability issues in containerized, cloud-native environments

Preffered Skills:

  • Familiarity with Nextflow for workflow orchestration
  • Exposure to bioinformatics, genomics data, cell imaging and histopathology imaging processing workflows
  • Experience with GCP Batch for running large-scale computational workloads

About the company

GSK is a science-led global healthcare company with a special purpose: to help people do more, feel better, live longer. We are on an audacious journey to impact the health of 2.5 billion people over the next decade. Our R&D division is at the forefront of this mission, dedicated to the discovery and development of groundbreaking vaccines and medicines. We are transforming the landscape of medical research by integrating cutting-edge science and technology and harnessing the power of genetics and new data. By fostering a collaborative environment that unites the talents of our people, we are revolutionizing R&D to pre-empt and defeat diseases. Join us in our commitment to uniting science, technology, and talent to get ahead of disease together., Uniting science, technology and talent to get ahead of disease together. GSK is a global biopharma company with a special purpose - to unite science, technology and talent to get ahead of disease together - so we can positively impact the health of billions of people and deliver stronger, more sustainable shareholder returns - as an organisation where people can thrive. We prevent and treat disease with vaccines, specialty and general medicines. We focus on the science of the immune system and the use of new platform and data technologies, investing in four core therapeutic areas (infectious diseases, HIV, respiratory/ immunology and oncology). Our success absolutely depends on our people. While getting ahead of disease together is about our ambition for patients and shareholders, it's also about making GSK a place where people can thrive. We want GSK to be a place where people feel inspired, encouraged and challenged to be the best they can be. A place where they can be themselves - feeling welcome, valued, and included. Where they can keep growing and look after their wellbeing. So, if you share our ambition, join us at this exciting moment in our journey to get Ahead Together. Inclusion at GSK GSK is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class. If you need any adjustments in the recruitment process, please get in touch with our Recruitment team (EMEA-GSKLink@allegisglobalsolutions.com) to further discuss this today. Important notice to employment businesses/agencies

Apply for this position