Site Reliability Engineer

Everi, Inc.

2 days ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Tech stack

Java

Amazon Web Services (AWS)

Cloud Computing

Cloud Computing Security

Continuous Integration

Distributed Systems

Python

Queueing Systems

RabbitMQ

Reliability Engineering

Prometheus

TypeScript

Software Vulnerability Management

Amazon Web Services (AWS)

Data Logging

Grafana

Gitlab

Kubernetes

Kafka

Video Streaming

Appdynamics

Docker

Jenkins

Job description

As a Site Reliability Engineer, you'll help shape and strengthen Evri's cloud infrastructure, reliability engineering practices and operational excellence. You'll work hands-on across AWS, container orchestration, observability platforms, and CI/CD ecosystems to ensure our systems are resilient, secure and optimised for scale.

Your work will be critical in enabling product teams to ship fast, safely and with confidence - all while improving performance, reducing risk, and ensuring Evri remains one of the most reliable logistics platforms in the UK., * Drive architectural and technical decision-making, ensuring infrastructure and platform designs support long-term scalability, reliability and security.

Partner with Delivery to plan and prioritise platform and infrastructure work for maximum technical and operational impact.
Mentor engineers and uplift technical capability, championing strong engineering practices and continuous improvement.
Shape technical strategy by contributing to architectural roadmaps, standards, and patterns-balancing innovation with long-term risk and resilience.
Embed quality, security, performance and compliance into all engineering designs, processes and operational workflows, ensuring reliability at scale.

Requirements

5+ years' experience in a DevOps or SRE role, ideally within AWS-based environments.
Strong proficiency with AWS CDK and Infrastructure as Code to deploy and optimise cloud infrastructure.
Hands-on experience with Docker and container orchestration such as Kubernetes (EKS) or Amazon ECS.
Proven experience building and maintaining CI/CD pipelines using GitLab, Jenkins or similar tooling.
Deep knowledge of monitoring, observability and logging tools such as Prometheus, Grafana, AppDynamics and OpenSearch.
Proficiency in Python, TypeScript or Java for building automation, tooling and reliability improvements.
Solid understanding of cloud security, including WAF, patching, vulnerability management and AWS Shield.
Working knowledge of message queues and streaming technologies such as RabbitMQ, Kinesis or Kafka.
Strong analytical and operational problem-solving skills, with the ability to identify performance constraints, eliminate single points of failure and scale distributed systems.
Experience participating in incident response, including root-cause analysis and driving long-term reliability improvements.
Excellent communication and collaboration skills to work effectively across architecture, delivery, and engineering teams.

About the company

At Evri, we know we only grow if our people do too. That's why we're committed to building a truly inclusive and diverse workplace where everyone can bring - and be - their whole authentic selves. We're on a journey to better represent the customers we serve around the UK. We're committed to removing barriers and ensure that each person at Evri is valued for who they are, and what they bring to our business.