Associate Director Data Engineer

AstraZeneca
Barcelona, Spain
10 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Barcelona, Spain

Tech stack

Query Performance
Java
Artificial Intelligence
Amazon Web Services (AWS)
Automation of Tests
Bioinformatics
C++
Cloud Computing
Code Review
Computer Programming
Databases
Continuous Integration
Data as a Services
Data Architecture
Information Engineering
Data Governance
Data Integration
Data Systems
Software Design Patterns
DevOps
Dimensional Modeling
Python
NoSQL
Performance Tuning
Query Optimization
Software Engineering
SQL Databases
Data Streaming
Systems Integration
Unstructured Data
Data Processing
Large Language Models
Snowflake
GIT
Cloudformation
Information Technology
Data Management
Terraform
Software Version Control
Databricks

Job description

We are seeking a hands-on Associate Director of Data Engineering to lead data architecture, modeling, warehousing, and platform engineering that accelerates scientific decision-making across Clinical Pharmacology & Safety Science (CPSS). You will design and deliver scalable, FAIR-aligned data solutions on enterprise infrastructure, driving positive, disruptive transformation toward AstraZeneca's Bold Ambition for 2030. This role partners closely with R&D IT and DS&AI and collaborates globally with colleagues in Sweden, the United Kingdom, and the United States.

What You'll Do

  • Data platform architecture: Design, implement, and operate robust, secure, and scalable data platforms and services that enable discovery, access, and reuse (FAIR), with clear SLOs for reliability and performance.

  • Modeling and warehousing: Define canonical data models, dimensional schemas, and lakehouse/warehouse layers; implement semantic modeling; optimize storage, compute, and query performance.

  • Data integration: Build and harden ingestion frameworks for structured and unstructured data; standardize metadata, lineage, and cataloging; ensure interoperability across domains.

  • Governance and quality: Establish and enforce standards for data quality, access control, retention, and compliance; implement monitoring, observability, and automated data quality checks.

  • Infrastructure engineering: Operate solutions across Unix/Linux HPC and cloud (AWS preferred), leveraging infrastructure-as-code to ensure reliability, scalability, and cost efficiency.

  • Collaboration: Translate scientific and business requirements into well-architected designs; co-create solutions with CPSS stakeholders, R&D IT, and DS&AI; set technical direction and roadmap.

  • Engineering excellence: Apply software engineering best practices (version control, CI/CD, automated testing, design patterns, code review) to deliver maintainable, resilient systems.

  • Enablement: Produce high-quality documentation, reusable components, and guidance; mentor engineers and uplift data engineering practices across teams.

Requirements

Do you have experience in Terraform?, Do you have a Master's degree?, * Education: Degree in Computer Science, Engineering, or related field, or equivalent industry experience.

  • Programming: Strong Python expertise; familiarity with Java or C++; ability to write clean, testable, performant code.

  • Platform architecture: Proven experience architecting and building data platforms and data-driven solutions at scale.

  • Software engineering: Track record delivering production-grade systems in data, AI, or scientific domains; proficiency with Git, CI/CD, automated testing, design patterns, and DevOps/SRE practices.

  • Data modeling and warehousing: Experience with dimensional modeling, semantic layers, and warehouse/lakehouse technologies (e.g., Snowflake, Databricks, TileDB).

  • Databases: Hands-on experience with SQL and NoSQL systems, query optimization, and performance tuning.

  • Compute environments: Practical experience with Unix/Linux HPC and cloud platforms (AWS preferred), including infrastructure-as-code (e.g., Terraform/CloudFormation).

  • Translation of needs: Ability to convert scientific/business requirements into robust technical solutions with measurable outcomes.

  • Technical leadership: Demonstrated experience leading end-to-end delivery, setting engineering standards, and guiding teams while remaining hands-on.

  • Core skills: Excellent problem-solving, analytical, and critical-thinking capabilities; attention to detail; strong communication and stakeholder management skills.

Desirable Skills & Experience

  • Generative and agentic AI: Exposure to LLM-enabled data services or agentic workflows.

  • Data processing and integration: Experience integrating structured and unstructured data at scale; familiarity with streaming and batch patterns.

  • Life sciences: Experience with clinical or pre-clinical drug discovery, imaging and bioinformatics data; understanding of domain ontologies and scientific data standards.

  • Governance and compliance: Experience with data governance, privacy, security-by-design, and relevant regulatory frameworks.

Ways of Working :

We value in-person collaboration to accelerate learning and decision-making. We typically work a minimum of three days per week from the office while balancing flexibility for individual needs.

About the company

We follow the science to explore and innovate, fusing data and technology with the latest scientific advances to achieve the next wave of breakthroughs. We listen and learn from people living with the diseases we treat to better understand needs and design the right interventions. If your passion is science and impact on patients' lives, this is the place to build a career that matters. Ready to make an impact? Apply now and join us in shaping the future of data architecture and infrastructure at AstraZeneca.   You must create an Indeed account before continuing to the company website to apply

Apply for this position