data engineer
Role details
Job location
Tech stack
Job description
- Own and evolve the company's data infrastructure: event tracking, ingestion pipelines, and analytics tooling
- Manage data collection pipelines via Snowplow, GTM (server-side and client-side), and Airflow-orchestrated workflows running on Cloud Composer
- Build and maintain BigQuery datasets: partitioning, clustering, cost optimization, scheduled queries, and access controls
- Write and maintain Python scripts for data ingestion, transformation, and reporting
- Manage Cloud Storage (GCS) for data staging, Snowplow enriched event storage, and pipeline artifacts
- Configure and maintain GCP IAM, service accounts, and access controls across services and external APIs
- Monitor and debug pipelines using Cloud Logging & Monitoring; set up alerting and ensure pipeline observability
- Work with relational and non-relational databases: schema design, query optimization, performance tuning
- Collaborate with frontend, QA, DevOps, and product teams
- Participate in code reviews and uphold code quality standards
- Contribute to technical improvements (testing, observability, performance, maintainability)
- Mentor mid-level engineers and contribute to architectural decisions
Requirements
-
Python - hands-on experience writing production scripts and data pipelines
-
SQL - strong practical experience: queries, indexes, performance tuning, and analytical workloads
-
Google Cloud Platform - working experience across GCP services including:
-
BigQuery - partitioning, clustering, cost optimization, scheduled queries, access controls
-
Cloud Composer - managed Airflow: DAG deployment, environment configuration, GCP Console monitoring
-
Cloud Storage (GCS) - data staging, pipeline artifacts, enriched event storage
-
IAM & service accounts - secure access management across GCP services and external APIs
-
Cloud Logging & Monitoring - pipeline observability, alerting, and debugging
Apache Airflow / Cloud Composer - building and maintaining DAGs and data workflowsSnowplow - event tracking setup and data collection pipelinesGoogle Tag Manager - server-side and client-side configuration and managementAzure Databricks - data processing and analytics workloadsUnderstanding of clean code principles (maintainable, testable pipelines and scripts)Comfortable with Git and code review workflows
Nice-to-have skills
- C# and .NET Core - backend/API development experience is a plus
- REST APIs - design, implementation, authentication/authorization basics
- SQL Server / T-SQL - experience with Microsoft SQL Server environments
- MongoDB
- Elasticsearch
- RabbitMQ or similar queuing system
- Docker / containers
- GraphQL (building or consuming APIs)
- Background job processing (e.g. Hangfire)
- Observability tools (e.g. New Relic or similar)
- PowerShell scripting
- Agile tooling (e.g. Jira)
- AI-assisted development tools (Cursor, etc.)
Benefits & conditions
- We prioritise office presence and emphasise in-person collaboration, but also offer appropriate adjustments where needed, in line with company policy
- Extra wellbeing days on top of your annual leave allowance
- Up to 3 paid volunteering days each year
- 24/7 confidential Employee Assistance Programme (wellbeing, mental health, legal & financial support)
- Learning & development support via the Frontiers Learning Hub
- Competitive local benefits country dependent (e.g. healthcare and pension/retirement provision)