Lead Data Platform Engineer
Role details
Job location
Tech stack
Job description
- Infrastructure & platform: AWS (EKS, MSK, S3), Kubernetes-based workloads, hybrid infrastructure setup-with a strong focus on scalability, resilience and cost-efficient cloud architecture
- Event streaming & ingestion: Kafka (AWS MSK), event-driven ingestion from mobile, web and backend systems-ensuring reliable, high-volume event processing
- Data processing & orchestration: Apache Spark for distributed data processing, Dagster for pipeline orchestration and dbt for structured transformation and modeling
- Warehouse & experimentation: Snowflake as the central warehouse layer and GrowthBook for experimentation evaluation-supporting reliable analytics and product experimentation
- Monitoring & observability: Prometheus, Grafana, InfluxDB, Graylog / ELK-for pipeline visibility, incident investigation and proactive reliability management
- AI & feature enablement foundations: Infrastructure supporting scalable feature computation, low-latency access patterns and production-ready AI-driven product capabilities
- Collaboration: GitHub-based workflows, structured RFC processes, cross-team architecture discussions and outcome-driven delivery, * Tools and support to do your best work: MacBook, monitor, and €1,000 annual learning budget, plus regular feedback and career development
- A truly international team with English as our company language and a shared commitment to growth and well-being - supported by Sunday Supplements and Nilo Health, Excited to take Yazio to the next level and help people worldwide lead healthier lives? We'd love to hear from you! Please send us:
- Your CV in English
- Any additional materials that showcase your work (e.g., LinkedIn profile, portfolio)
Requirements
Do you have experience in Spark?, * Extensive experience designing and operating large-scale, cloud-based data platforms in production
- Strong background in distributed systems, event-driven architectures and streaming pipelines
- Proven ownership of production reliability (SLAs/SLOs, monitoring, incident handling)
- Experience architecting systems that successfully scaled through significant growth phases
- Clear cost-performance awareness when running compute-intensive workloads
- Experience enabling data-driven product features or AI-powered use cases (e.g., feature computation, low-latency access patterns)
- Solid people leadership experience-you have led engineers before and taken responsibility for their growth and performance
- Strong architectural judgment and the confidence to make and defend technical decisions
- Ability to translate infrastructure work into measurable business outcomes
You've likely worked in a product-driven, high-traffic environment and have seen what breaks at scale-and learned how to design systems that don't.
Benefits & conditions
Yazio is entering a new phase where our data platform must support increasing scale, complexity and AI-powered product features.
You will evolve our data platform from a reactive, analytics-support setup into a proactive, production-grade infrastructure that enables experimentation velocity and product innovation.
You will:
- Own reliability, scalability and cost discipline of ingestion and transformation systems
- Design and deliver infrastructure for real-time/near-real-time feature computation
- Define and enforce SLAs/SLOs, monitoring and incident practices
- Set architecture standards and prevent reactive migrations through foresight and trade-offs
- Translate platform work into measurable OKRs that deliver product impact
- Partner closely with Analytics to define clear ownership boundaries
- Lead and grow a small, ambitious team while raising technical standards
You will be hands-on where it matters: RFCs, architecture reviews, incident response and mentoring.