Site Reliability Engineering Manager (Data Infra)

ComplyAdvantage

Charing Cross, United Kingdom

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Remote

Charing Cross, United Kingdom

Tech stack

API

Amazon Web Services (AWS)

Application Performance Management

Cloud Computing

Cloud Engineering

Data Warehousing

Elasticsearch

PostgreSQL

Machine Learning

Platform as a Service (PAAS)

Redis

Reliability Engineering

Newrelic

Datadog

CircleCI

Cloud Platform System

Istio

Grafana

Spark

Multi-Cloud

GIT

Data Layers

Kubernetes

Low Latency

Kafka

Terraform

Data Pipelines

Service Stack

Microservices

Job description

We are looking for a driven and experienced Site Reliability Engineering Manager to join our innovative Tech Team. You will lead and empower a team of SREs, partnering closely with Engineering, Product, and Security to ensure our platforms are resilient, scalable, and secure. You will play a key role in shaping reliability strategy, improving system performance, and embedding best practices in observability, incident management, and automation to support the delivery of high-impact solutions in the fight against financial crime., * Take ownership of your team, being responsible for current team members' growth and development, plus hiring and onboarding new team members

Create a positive environment where your team members thrive to deliver the best outcomes and innovations
Be a role model for your team, mentoring and coaching them, whilst having a learning mindset yourself, being open to new ideas and technologies
Within the context of our broader technology vision, set the direction for your team and take accountability for tech decisions
Use your specific experience working with cloud systems to input into technical decision-making
Work with other stakeholders across engineering to ensure the systems and services your team provides meet the needs of your internal customers
Collaborate, both within your team and across the tribe to ensure your team's implementation meets industry standards

The role reports to the Director of Infrastructure. You'll be managing a team of Engineers focused on the provision and support of our Stateful / Data layer technologies powering all of our services, both in development and production. The main technologies we use are YugaByte (sharded Postgres), Kafka (via Strimzi), Elasticsearch (via ECK), Redis and Spark/data warehousing on GCP and AWS using their PaaS systems. As the technology stack underpins all other engineering work, a collaborative mindset is a must.

Our tech stack:

ComplyAdvantage is fully cloud-based, with a modern kubernetes-focused tech stack. All compute workloads run in Kubernetes, with clusters in multiple regions to support the needs of our global client base. Our production services are multi-cloud by design and are currently hosted in AWS and GCP.

We make heavy use of Terraform and Helm to define our infrastructure and services, and lean heavily on GitOps paradigms - production and non-production environments are defined in git and changes to these environments (both cloud infrastructure and Kubernetes applications) are managed via git.

ArgoCD is our tool of choice for controlling our deployments, and paired with our Istio mesh, allows us for advanced deployment patterns used by our development teams such a progressive rollouts. Our observability stack consists of Grafana Cloud, along with some on-prem Mimir, amongst others. We focus on Open Telemetry for application metrics, with SLO and metric driven alerting at all levels, from Cloud infra through to application performance.

Across the wider Technology team, teams build and release containerised applications to support the wide array of activities that our teams are engaged in - from developing low latency client-facing APIs, to machine learning models and data processing pipelines.

Requirements

Do you have experience in Terraform?, As an Site Reliability Engineering (SRE) Manager, you will

Have experience of managing and growing high performing engineering teams
Have experience with Kubernetes and Terraform
Have experience hosting microservices-based architectures
Have experience of working with cloud native architectures (AWS and GCP are preferred)
Have good communication and writing skills including experience writing technical documentation

Nice to haves:

Experience of working in a start-up/ scale-up environment
Have experience managing observability platforms, whether self-hosted or third party - eg Grafana stack, Datadog, NewRelic
Have experience managing pipeline tools, whether self-hosted or third party - eg CircleCI, ArgoCD, Harness, etc

Benefits & conditions

Pulled from the full job description

Annual leave
Life insurance
Company pension, * Equity participation in our innovative mission to combat financial crime
Unlimited Time Off Policy to promote work-life balance and well-being
We embrace a hybrid approach that requires employees to be in the office for two days a week. We strongly believe that this approach fosters collaboration and enables the building of meaningful relationships
Opportunities for collaboration and career development with smart, like-minded professionals
Annual learning budget to support professional growth
A home office budget to support working from home
Enhanced parental leave and childcare benefits
Life insurance and medical coverage through BUPA, including pre-existing conditions
Pension contribution through The People's Pension

About the company

Our mission is to empower every business to eliminate financial crime. By harnessing AI, a unified platform, and an extensive partner ecosystem, we help customers turn compliance into a catalyst for growth, operational resilience, and enduring regulatory trust. More than 3,000 enterprises across 75 countries rely on our end-to-end platform and the world's most comprehensive financial crime risk intelligence. With full-stack agentic automation, we help organizations automate up to 95% of KYC, AML, and sanctions reviews, cut onboarding times by 50%, reduce false positives by 70%, and handle 7x more work with the same staff. ComplyAdvantage is headquartered in London and has global hubs in New York, Lisbon, Singapore, and Cluj-Napoca. It is backed by Balderton Capital, Index Ventures, Ontario Teachers' Pension Plan, Goldman Sachs, and Andreessen Horowitz. Learn more about compliance re-engineered for the age of AI at complyadvantage.com.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all