(Or Senior) Data Platform Architect - Generative Biology Institute
Role details
Job location
Tech stack
Job description
At EIT we are seeking an experienced and detailed orientated Data Platform Architect to play a pivotal part in designing and implementing cutting-edge data platforms to support the GBI mission. You'll collaborate closely with cross-functional teams to understand research requirements and translate them into robust data models and architectures.
As a Data Platform Architect, you'll have the opportunity to shape the future of our data platform and collaborate with research and product teams to deliver analytical and AI products to transform accelerate bioscience discovery and translational research. You'll be responsible for defining and implementing data standards, data models and best practices to ensure the integrity, security, and accessibility of our data assets. Additionally, you'll play a key role in optimising data processes and workflows, driving efficiencies, and fostering a data-driven culture within the organization.
This is a role for computing systems engineers and researcher who think long-term and want to help build a research infrastructure that will underpin the next generation of scientific and technological discovery., * Formulating the data model and standards to be used by GBI's data platform to support interoperability, automation, and bioscience research.
- Collaborate with various stakeholder groups to ensure GBI's data platform works seamless with similar systems across EIT and external collaborator. Responsible for producing architecture artifacts and presenting the work through architecture governance.
- Developing data platform including different data flows, data lifecycle, data security, durability, provenance, as well as applying consistent documentation standards and architecture methods.
- Supporting developers and researchers, making sure they can fully utilize the data platform by a combination of mentoring and direct involvement.
- (senior level hires only) Manage GBI's data platform on HPC environments, including Linux-based clusters, schedulers (e.g., Slurm), and high-performance storage systems (e.g., Lustre, BeeGFS, GPFS).
- (senior level hires only) Support reproducible research through data provenance, containerization (Singularity, Docker, etc.), workflow orchestration (Nextflow, Kubernetes, OpenHPC, etc.), and MLOps.
Requirements
Do you have experience in Metadata?, Essential Knowledge, Skills and Experience:
- Knowledge of master, metadata and reference data management
- Knowledge of architecting and delivering modern data platform standards, tools and patterns including data lakes, lake houses, iceberg, data mesh
- Ability to work collaboratively with multidisciplinary research teams and translate computational needs into technical solutions.
Desirable Knowledge, Skills and Experience:
- Experience architecting, building, and delivering modern data platforms at scale
- (senior level hires only) 3+ years of relevant experience managing HPC systems in research, biological and biomedical, or academic environment
- (senior level hires only) Extensive experience designing, deploying, and managing storage systems for HPC clusters (or cloud computing) in scientific or research settings.
Key Attributes:
- Collaboration
- Ability to work in a fast-paced environment
- Willingness to learn and cross train / upskill in new technology
- Willingness to be hands on to explore new technology or develop POC's
Benefits & conditions
Our Benefits:
- Competitive salary + travel allowance + bonus
- Enhanced holiday pay
- Pension
- Life Assurance
- Income Protection
- Private Medical Insurance
- Hospital Cash Plan
- Therapy Services
- Perk Box
- Electric Car Scheme
Working Together - What It Involves:
- You must have the right to work permanently in the UK with a willingness to travel as necessary. In certain cases, we can consider sponsorship, and this will be assessed on a case-by-case basis.
- You will live in, or within easy commuting distance of, Oxford (or be willing to relocate).