Senior Site Reliability Engineer (AWS, DBT)

Initialize IT
Charing Cross, United Kingdom
2 days ago

Role details

Contract type
Contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Charing Cross, United Kingdom

Tech stack

Airflow
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Continuous Integration
Software Debugging
Identity and Access Management
Python
Reliability Engineering
Prometheus
Datadog
Snowflake
Grafana
Amazon Web Services (AWS)
Gitlab
Functional Programming
Cloudwatch
Terraform
Splunk
Docker
Jenkins

Job description

Senior Site Reliability Engineer (AWS, DBT) - London - 2 days a week - up to £600

Tech Stack: Snowflake, dbt, AWS (Migration from GCP + Snowflake + dbt)

Role: Ensure reliability, performance, and operational excellence of Snowflake + dbt workloads on AWS.

MUST HAVE Skills:

AWS core services expertise (EC2, S3, IAM, VPC, CloudWatch, Lambda).

Snowflake operations, tuning, cost governance, incident response.

dbt operations & model dependency debugging.

Observability with Grafana/Prometheus/Datadog/Splunk.

CI/CD with Terraform, GitLab/Jenkins.

Strong SRE fundamentals: SLO/SLA/Error Budgets, RCA, automation.

GOOD TO HAVE Skills:

Airflow/Prefect, Secrets Management.

Docker/EKS, Python automation.

Zero-downtime migration patterns.

Responsibilities:

Reliability for Snowflake + dbt workloads.

Migration reliability engineering.

Monitoring, alerting, dashboards.

Incident ownership & RCA.

Experience: 7-12+ years engineering, 3-5+ years SRE.

Requirements

AWS core services expertise (EC2, S3, IAM, VPC, CloudWatch, Lambda).

Snowflake operations, tuning, cost governance, incident response.

dbt operations & model dependency debugging.

Observability with Grafana/Prometheus/Datadog/Splunk.

CI/CD with Terraform, GitLab/Jenkins.

Strong SRE fundamentals: SLO/SLA/Error Budgets, RCA, automation.

GOOD TO HAVE Skills:

Airflow/Prefect, Secrets Management.

Docker/EKS, Python automation.

Zero-downtime migration patterns.

Responsibilities:

Reliability for Snowflake + dbt workloads.

Migration reliability engineering.

Monitoring, alerting, dashboards.

Incident ownership & RCA.

Experience: 7-12+ years engineering, 3-5+ years SRE.

Apply for this position