Distributed Systems Engineer - Exascale Data Services
Role details
Job location
Tech stack
Job description
ECMWF is hiring a Distributed Systems Engineer (A2) to join the Data Infrastructure Services Team. The team builds and maintains high-performance data services that underpin ECMWF's real-time weather forecasting and climate data operations. This role focuses on developing and scaling systems that provide efficient access to petabyte-scale datasets used by scientists, policymakers, and operational services worldwide.
Your core responsibility is the development and maintenance of software and service components that power the Copernicus Climate Data Store (CDS). The CDS provides scalable access to ERA5 (12PiB), ERA6 (~25PiB) and many other climate datasets. It serves over 100 TiB of data, responds to around half a million user requests, and supports over 5,000 active users every day.
You will contribute to advancing our data infrastructure by developing reusable components that support fast, scalable, and consistent data access. This work will align technologies across ECMWF's datastores, including those for our core mandate, external services, and Destination Earth. The focus is on harmonising data handling solutions and access interfaces to ensure performance and consistency across all of ECMWF's datastore platforms.
Beyond internal development, you will act as a technical catalyst and architectural authority for software contributions from external partners. By providing technical stewardship and rigorous oversight, you will ensure that the innovations align with ECMWF's engineering vision and integrate seamlessly into our operational pipelines. This collaborative oversight allows you to multiply your technical impact.
In this role, you would occupy a pivotal position in the digital transformation of weather and climate science. The systems you create will be designed to meet the extreme demands of modern analysis, delivering high-fidelity access to the world's most significant environmental datasets. You will directly influence how civilizational challenges are addressed by redefining the efficiency and intuitive nature of weather and climate data access., * Design technical architecture and define requirements for ECMWF datastore software that delivers climate data to a global user base, working closely with production and infrastructure teams.
- Develop and maintain robust and scalable software and services for distributed, high-volume datastores.
- Provide technical stewardship and oversight to the work of external contributors, ensuring technical alignment, code quality, and operational readiness
- Develop integrations between the Climate Data Store and other technologies, including ECMWF and Destination Earth data systems.
- Collaborate with operational teams to deploy, monitor, and support production services that provide reliable data access.
- Present ECMWF data systems at international conferences and workshops, showcasing technical advancements and fostering collaboration., At ECMWF, we consider an inclusive environment as key for our success. We are dedicated to ensuring a workplace that embraces diversity and provides equal opportunities for all, without distinction as to race, gender, age, marital status, social status, disability, sexual orientation, religion, personality, ethnicity and culture. We value the benefits derived from a diverse workforce and are committed to having staff that reflect the diversity of the countries that are part of our community, in an environment that nurtures equality and inclusion.
Requirements
Do you have experience in System design?, * Excellent analytical and problem-solving skills with a proactive and constructive approach.
- Flexibility, with the ability to adapt to changing priorities and user needs.
- Excellent interpersonal and communication skills are vital, to communicate with a wide range of technically skilled colleagues, including remote teams, and to non-technical staff.
- Dedication and enthusiasm to work in a team and across teams.
- Highly organised with the capacity to work on a diverse range of tasks., * An advanced university degree (EQ7 level or above) or equivalent experience in computer science or engineering, computational science, physics or natural sciences, mathematics, or a related discipline.
- Proven experience in Python or Rust for high-performance applications.
- Good understanding of distributed systems and system design (e.g., consistency models, asynchronous I/O, networked services).
- Experience in technical leadership or code review, ensuring high standards across internal and external contributions.
- Familiarity with container orchestration or microservice patterns in high-availability, operational settings is advantageous.
- Familiarity with full software lifecycle maintenance, including testing, deployment, and long-term support, is highly desirable.
- Candidates must be able to work effectively in English