Data Architect

Community Of
Municipality of Madrid, Spain
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Remote
Municipality of Madrid, Spain

Tech stack

Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Apache HTTP Server
Computing Platforms
Cloud Computing
Data Architecture
Data Governance
ETL
DevOps
Distributed Data Store
Distributed Systems
Fault Tolerance
Information Lifecycle Management
Netconf
Performance Tuning
OPC Unified Architecture
Simple Network Management Protocols
Spark
Data Management
Data Pipelines

Job description

Responsibilities Define and lead the design of edge-native data architectures for industrial network telemetry, covering ingestion, modeling, storage, and local inference layers. Translate and adapt modern cloud lakehouse patterns (Apache Iceberg) into scalable, edge-compatible architectures, ensuring efficiency in constrained environments. Own the end-to-end data architecture, including data modeling, partitioning strategies, lifecycle management, and evolution of datasets. Design and oversee data orchestration frameworks (Airflow, Prefect, Dagster, or similar), ensuring reliability, observability, and scalability across distributed edge systems. Architect modular, reusable, and protocol-agnostic ingestion frameworks tailored for industrial edge gateways and heterogeneous data sources. Partner closely with AI/ML teams to define robust data pipelines that enable on-device inference, real-time analytics, and anomaly detection use cases. Produce and maintain architecture decision records

Requirements

(ADRs), technical blueprints, and reference architectures, aligning with product strategy and long-term scalability. Drive the evolution of decentralized, real-time, low-latency data platforms, minimizing dependency on centralized cloud infrastructures. Act as a technical reference and decision-maker, guiding best practices in data architecture, governance, and platform design. Benefits Permanent full-time contract. 100% remote work, based anywhere in Spain. Flexible working hours and reduced working hours on Fridays. Competitive salary aligned with a senior/architect-level profile. Private health insurance fully covered by the company. Free access to an e-learning platform covering cloud, DevOps, security, languages, etc. Gympass-Wellhub subscription with access to gyms and sports activities. Requirements Proven experience as a Data Architect (or Senior Data Engineer evolving into architecture roles) in complex, distributed data platforms. Strong hands-on expertise with Apache Iceberg, including schema evolution, partitioning strategies, performance optimization, and time travel. Deep knowledge of data architecture patterns (lakehouse, data mesh, distributed systems, edge computing constraints). Solid experience designing and governing data orchestration frameworks with a focus on scalability, observability, and fault tolerance. Experience building ETL/ELT architectures in resource-constrained or edge/near-edge environments. Strong understanding of data modeling, data lifecycle, governance, and data quality frameworks. Knowledge of industrial/OT protocols (SNMP, NETCONF, OPC-UA) is a strong plus. Familiarity with low-level or performance-oriented languages (Rust, C) is a plus for edge execution components. Exposure to cloud ecosystems (AWS, Glue, Lambda, Spark) is valued but not mandatory. Fluent English required. Based in Barcelona or Madrid with a valid work permit. Equal Opportunity No barrier should hold back talent Dedicated to equal opportunities, our Group embraces an inclusive approach: every position is open to all, and we actively support the integration of people with disabilities.

Apply for this position