Platform / Devops Engineer (Gerona)

Xebia
Oña, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Oña, Spain

Tech stack

A/B testing
Application Release Automation
Cloud Computing
Continuous Integration
DevOps
Programming Tools
Distributed Systems
Fault Tolerance
Site Reliability Engineering Practices
Data Streaming
Workflow Management Systems
Data Logging
Cloud Platform System
Software Troubleshooting
Event Driven Architecture
Deployment Automation
Avro
Dynatrace
Programming Languages

Job description

We are looking for a Platform or DevOps Engineer with strong experience in event-driven architectures and cloud-native platforms. This role focuses on enabling resilient, observable, and continuously deployable event-driven systems operating at scale. The successful candidate will help deliver core platform capabilities including end-to-end distributed tracing, experimentation frameworks, workflow orchestration, and operational tooling across multiple environments. The ideal candidate will bring practical experience operating event-driven systems in production environments, with particular emphasis on observability, deployment strategies, and platform reliability within GCP-based ecosystems. Key Responsibilities Design, build, and maintain DevOps and platform engineering capabilities within an event-driven, n8n-based architecture Implement end-to-end traceability across distributed workflows, tracking requests from client entry points through workflow execution and downstream services Deliver distributed tracing and observability capabilities using OpenTelemetry (OTel) Support multi-environment observability, monitoring, logging, and operational diagnostics Implement experimentation and continuous learning capabilities including: A/B testing Canary deployments Progressive rollout strategies Enable execution and orchestration of multiple workflows in parallel Develop and maintain deployment definitions, release automation, and operational tooling Generate and publish SDKs from Avro schemas across multiple programming languages Implement resilience patterns within shared libraries and platform components, including: Retries Circuit breakers Flow control Collaborate with engineering teams to establish platform standards, reusable tooling, and operational best practices Provide ongoing platform support, maintenance, troubleshooting, and continuous improvement

Requirements

Strong experience in Platform Engineering and DevOps within cloud-native environments Hands-on experience with event-driven architectures and distributed systems at scale Minimum 2 years of experience with GCP, particularly Pub/Sub-based systems Strong understanding of distributed tracing and observability concepts Experience designing resilient distributed systems and fault-tolerant integration patterns Familiarity with canary deployments, progressive delivery, and experimentation frameworks Experience with CI/CD automation and deployment orchestration Understanding of workflow orchestration and asynchronous processing patterns Strong troubleshooting, operational support, and production engineering skills Experience with n8n workflow automation platforms Nice to have Experience working with Avro schemas and schema registry solutions Familiarity with shared platform libraries and internal developer platforms Experience supporting high-throughput messaging or event streaming systems Exposure to SRE practices and platform reliability engineering Experience building multi-language developer tooling and SDK publishing pipelines Experience with SDK generation or schema-driven development approaches Experience implementing OpenTelemetry (OTel) instrumentation and tracing #J-*****-Ljbffr

Apply for this position