Platform / Devops Engineer

Xebia
Municipality of Bilbao, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Municipality of Bilbao, Spain

Tech stack

Application Release Automation
Cloud Engineering
Continuous Integration
DevOps
Programming Tools
Distributed Systems
Fault Tolerance
Site Reliability Engineering Practices
Data Streaming
Data Logging
System Availability
Event Driven Architecture
Deployment Automation
Avro
Dynatrace

Requirements

Job Description - Platform / DevOps Engineer (Event-Driven Platforms)Role OverviewWe are looking for a Platform or DevOps Engineer with strong experience in event-driven architectures and cloud-native platforms.This role focuses on enabling resilient, observable, and continuously deployable event-driven systems operating at scale. The successful candidate will help deliver core platform capabilities including end-to-end distributed tracing, experimentation frameworks, workflow orchestration, and operational tooling across multiple environments.The idóneo candidate will bring practical experience operating event-driven systems in production environments, with particular emphasis on observability, deployment strategies, and platform reliability within GCP-based ecosystems.Key ResponsibilitiesDesign, build, and maintain DevOps and platform engineering capabilities within an event-driven, n8n-based architectureImplement end-to-end traceability across distributed workflows, tracking requests from client entry points through workflow execution and downstream servicesDeliver distributed tracing and observability capabilities using OpenTelemetry (OTel)Support multi-environment observability, monitoring, logging, and operational diagnosticsImplement experimentation and continuous learning capabilities including:A/B testingcanary deploymentsprogressive rollout strategiesEnable execution and orchestration of multiple workflows in parallelDevelop and maintain deployment definitions, release automation, and operational toolingGenerate and publish SDKs from Avro schemas across multiple programming languagesImplement resilience patterns within shared libraries and platform components, including:retriescircuit breakersflow controlCollaborate with engineering teams to establish platform standards, reusable tooling, and operational best practicesProvide ongoing platform support, maintenance, troubleshooting, and continuous improvementRequired Skills & ExperienceStrong experience in Platform Engineering and DevOps within cloud-native environmentsHands-on experience with event-driven architectures and distributed systems at scaleMinimum 2 years of experience with GCP, particularly Pub/Sub-based systemsStrong understanding of distributed tracing and observability conceptsExperience designing resilient distributed systems and fault-tolerant integration patternsFamiliarity with canary deployments, progressive delivery, and experimentation frameworksExperience with CI/CD automation and deployment orchestrationUnderstanding of workflow orchestration and asynchronous processing patternsStrong troubleshooting, operational support, and production engineering skillsExperience with n8n workflow automation platformsNice to haveExperience working with Avro schemas and schema registry solutionsFamiliarity with shared platform libraries and internal developer platformsExperience supporting high-throughput messaging or event streaming systemsExposure to SRE practices and platform reliability engineeringExperience building multi-language developer tooling and SDK publishing pipelinesExperience with SDK generation or schema-driven development approachesExperience implementing OpenTelemetry (OTel) instrumentation and tracing #J-*****-Ljbffr

Apply for this position