Software Engineer (DevOps) - Observability Platform
Role details
Job location
Tech stack
Job description
Our Technology team builds the systems and experiences that keep Revolut moving. From the infrastructure behind our innovative app to the features used by millions of people around the world, they bring sharp thinking, speed, and a focus on meaningful impact to everything they do., * Designing, implementing, and assembling scalable and resilient observability solutions across logs, metrics, and traces, leveraging existing market solutions or technologies from scratch
- Building robust APIs and data pipelines to ingest, process, and expose observability data to product teams
- Collaborating closely with product teams to understand their observability needs and integrate solutions that empower them to monitor, alert, and debug their components effectively
- Optimising the observability infrastructure for performance, accuracy, cost-effectiveness, and an exceptional user experience
- Developing and maintaining tooling to automate the onboarding/sunsetting of components to the observability platform and streamline data collection
- Contributing to the strategic roadmap of the observability platform, identifying and implementing new features and improvements
Requirements
- 7+ years of experience as a software engineer, with 3+ years focused on building and maintaining observability platforms or highly distributed systems
- Familiarity with monitoring, alerting, and incident response best practices
- Expertise in designing and implementing APIs and data pipelines for high-throughput, real-time data ingestion
- A practical understanding of distributed systems and their unique observability challenges
- Hands-on experience with core observability tools, such as Prometheus, Grafana, Loki, ELK stack (Elasticsearch, Logstash, Kibana), Jaeger, OpenTelemetry, etc.
- Experience with containerisation and orchestration technologies (Docker, Kubernetes) and infrastructure as code tools (e.g., Ansible, Terraform)
- Proficiency in Python as your primary engineering language
Nice to have
- Previous experience in a DevOps, SRE, or developer experience role
- Experience with multiple cloud platforms (AWS, GCP, Azure) and their native observability services
- Contributions to open-source observability projects
- A track record of prototyping and sketching new solutions to complex problems
Building a global financial super app isn't enough. Our Revoluters are a priority, and that's why in 2021 we launched our inaugural D&I Framework, designed to help us thrive and grow everyday. We're not just doing this because it's the right thing to do. We're doing it because we know that seeking out diverse talent and creating an inclusive workplace is the way to create exceptional, innovative products and services for our customers. That's why we encourage applications from people with diverse backgrounds and experiences to join this multicultural, hard-working team.