Data Warehouse & Reporting Developer
Fujitsu
Brussels, Belgium
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Brussels, Belgium
Tech stack
API
Airflow
Apache HTTP Server
Databases
System Configuration
Data Architecture
Data Mining
Data Warehousing
Relational Databases
Python
Enterprise Messaging Systems
Online Analytical Processing
Open Source Technology
RabbitMQ
Standard Sql
Data Streaming
Unstructured Data
Spark
Containerization
Data Lake
Kubernetes
Luigi
Production Code
Kafka
Data Management
Data Lakehouse
Data Pipelines
Docker
Job description
Fujitsu is looking for a Data Warehouse & Reporting Developer to support a major EU Institution in Brussels. The role focuses on the design, development, and maintenance of a fully open-source Data Lakehouse platform , enabling scalable analytics and reporting across large volumes of structured and unstructured data.
️ Main Responsibilities
- Design, develop, and maintain a fully open-source Data Lakehouse
- Build and optimize scalable data pipelines for structured and unstructured data
- Integrate data from multiple sources (databases, APIs, streaming services, cloud platforms)
- Optimize queries and workflows for performance and efficiency
- Write modular, testable, production-grade code
- Ensure data quality through monitoring, validation, and consistency checks
- Define and execute test programs
- Produce clear and comprehensive technical documentation
- Support deployment and system configuration activities
Requirements
- Extensive experience as a Data Engineer or Data Architect
- Strong background in data warehouse and/or data lakehouse architecture
- Excellent SQL skills
- Strong experience with open-source data transformation tools :
- dbt
- Apache Spark
- Trino
- Good knowledge of Python
- Experience building end-to-end ELT data pipelines
Data Platforms & Infrastructure
- Open-source orchestration tools (Airflow, Dagster, Luigi)
- Event streaming & messaging platforms (Kafka, RabbitMQ)
- Storage frameworks (Apache Iceberg, Delta Lake)
- Containerization & orchestration (Docker / Podman, Kubernetes)
- Relational databases and data modelling tools
- OLAP and data mining tools