Senior Delivery Consultant - Data , ProServe EMEA
Role details
Job location
Tech stack
Job description
The Amazon Web Services Professional Services (ProServe) team is seeking a Delivery Consultant specializing in Data to join our Healthcare and Life Sciences (HCLS) practice. You will be at the center of the most consequential shift in enterprise technology: making organizations truly AI-ready. Every agentic AI system, every foundation model grounded in enterprise knowledge, and every GenAI application that moves from prototype to production depends on the data layer beneath it - and that's what you build.
You will design and implement modern data platforms (lake, lakehouse, mesh), architect data pipelines that transform raw, fragmented data estates into governed, AI-ready assets; and design and implement enterprise RAG architectures, vector stores, semantic ontologies, and knowledge graph architectures that allow foundation models and AI agents to reason accurately, access data securely, and execute autonomously within regulated environments. You will work hands-on inside HCLS customer environments with complex data lineage, regulatory overlays (GxP, HIPAA, CDISC), and legacy systems, and ship production-grade data products that serve multiple downstream consumers, from ML model training to agentic orchestration layers.
The AWS Professional Services organization is a global team of experts that help customers realize their desired business outcomes when using AWS services. We work together with customer teams and the AWS Partner Network (APN) to execute enterprise cloud computing and AI transformation initiatives.
Key job responsibilities
Design and implement production-grade data pipelines, data lakes, lakehouses, and data mesh architectures within enterprise HCLS environments, integrating with legacy systems and existing data governance frameworks
Build data products that serve multiple downstream applications and use cases - from AI/ML model training to agentic AI systems, ensuring data quality, lineage, and reliability at scale
Operate with a high degree of autonomy within fast-moving delivery engagements, making judgment calls on data modeling, pipeline design, and architecture without waiting for perfect specifications or constant oversight
Navigate complex data access, security, and privacy requirements unique to pharma and healthcare including GxP compliance constraints, HIPAA, and regulatory data governance frameworks
Architect contextual knowledge layers, including ontologies and knowledge graphs leveraging AWS Context, Amazon Bedrock Knowledge Bases, and custom ontology extensions to equip AI agents with the vocabulary and guardrails to reason accurately and execute autonomously within regulated environments
Collaborate across organizational boundaries to secure data access, understand source system context, and resolve data quality challenges with teams across customer IT, business, and partner organizations
Deliver iteratively when requirements are ambiguous, translating incomplete business needs into well-architected data solutions that can evolve as customer understanding matures
Apply AI-DLC (AI-accelerated Development Life Cycle) methodologies to data delivery to redesign data workflows to become AI-native for accelerated scale and pace
Requirements
- 5+ years of experience in data engineering, data architecture, and/or data platform development, with hands-on implementation of production data pipelines
- Bachelor's degree in Computer Science, Engineering, Data Science, related field, or equivalent experience
- Proficiency in modern data platform design patterns, including data lakes, lakehouses, data mesh, and zero-ETL patterns and streaming architectures, using services such as Amazon SageMaker Lakehouse, SageMaker Unified Studio, Amazon S3 Tables, Amazon Redshift, and zero-ETL integrations.
- Experience with architecting and engineering ontologies and knowledge graphs in enterprise environments, * AWS certifications in Data Analytics or Machine Learning Specialty preferred
- Experience in the healthcare and life sciences industry, including familiarity with compliance and security frameworks (HIPAA, GxP) and clinical data standards (OMOP, CDISC, FHIR)
- Hands-on experience with Apache Iceberg, Spark, Databricks, Snowflake, Kafka, or equivalent distributed data processing frameworks
- Experience designing and implementing knowledge graph architectures, ontology models, or semantic data layers that support AI/ML and agentic AI systems
- Experience with data governance and cataloging tools (e.g., AWS Glue Data Catalog, Collibra, Alation) and data lineage tracking and designing data access patterns that support identity and least-priviledge access
- Experience collaborating with customer business teams, IT, and partner organizations to understand data requirements and resolve access challenges and conveying technical concepts to both technical and business audiences.
- Proficiency in AI-DLC or equivalent AI-accelerated development methodologies - including prompt engineering as a development discipline, mob programming with AI, and experience validating AI-generated data pipeline code for production deployment in regulated environments