Senior Site Reliability Engineer
Role details
Job location
Tech stack
Job description
Senior Site Reliability Engineer
Location: London (Hybrid - 1-2 days onsite)
Contract: Initial 6 Months
IR35 Status: Outside IR35
TechNET IT has partnered with a leading enterprise organisation within the UK media and telecommunications sector, supporting a major cloud and data platform transformation programme across a highly scalable environment. We are currently looking for an experienced Senior Site Reliability Engineer (SRE) to join on an initial 6-month Outside IR35 contract, supporting the migration of critical Snowflake and dbt workloads from GCP to AWS.
This is an excellent opportunity to work within a large-scale engineering function focused on operational excellence, reliability engineering, automation, and modern cloud infrastructure practices.
Key Responsibilities:
Ensure reliability, scalability, and operational performance across Snowflake + dbt workloads
Support cloud migration reliability engineering initiatives from GCP to AWS
Build and enhance monitoring, alerting, and observability frameworks
Drive incident management, RCA processes, and platform automation improvements
Work closely with platform and engineering teams to improve resilience and performance across the wider data ecosystem
Essential Skills & Experience:
Strong AWS expertise across EC2, S3, IAM, VPC, Lambda, and CloudWatch
Proven hands-on experience operating and optimising Snowflake environments
Strong dbt operational knowledge including model dependency troubleshooting
Experience with observability tooling such as Grafana, Datadog, Prometheus, or Splunk
CI/CD and Infrastructure-as-Code experience using Terraform, GitLab, or Jenkins
Strong understanding of SRE best practices including SLOs, SLAs, Error Budgets, automation, and incident response
Desirable Experience:
Docker/Kubernetes/EKS
Python automation
Airflow or Prefect
Secrets Management
Experience supporting zero-downtime migration programmes
Background:
7-12+ years within software engineering or platform engineering environments
3-5+ years operating within dedicated SRE functions
Experience within large-scale enterprise or cloud-native environments preferred
If interested, please apply directly or contact me for further information.
Requirements
Strong AWS expertise across EC2, S3, IAM, VPC, Lambda, and CloudWatch
Proven hands-on experience operating and optimising Snowflake environments
Strong dbt operational knowledge including model dependency troubleshooting
Experience with observability tooling such as Grafana, Datadog, Prometheus, or Splunk
CI/CD and Infrastructure-as-Code experience using Terraform, GitLab, or Jenkins
Strong understanding of SRE best practices including SLOs, SLAs, Error Budgets, automation, and incident response
Desirable Experience:
Docker/Kubernetes/EKS
Python automation
Airflow or Prefect
Secrets Management
Experience supporting zero-downtime migration programmes
Background:
7-12+ years within software engineering or platform engineering environments
3-5+ years operating within dedicated SRE functions
Experience within large-scale enterprise or cloud-native environments preferred