Site Reliability Engineer (SRE) - Azure/Kubernetes - Urgent!
Role details
Job location
Tech stack
Job description
We're looking for a Lead Cloud Site Reliability Engineer (SRE) with strong expertise in Azure, Kubernetes, Terraform, and GitHub to lead large-scale projects and mentor a growing team., * Lead SRE activities for large-scale cloud projects, providing technical guidance to engineers.
- Deliver solutions across VMs and Kubernetes, ensuring efficient deployment, scaling, and management.
- Implement CI/CD pipelines using GitHub Actions or similar tools.
- Design and manage Infrastructure as Code (IaC) using Terraform (preferred), Ansible, Jenkins, etc.
- Assess networking requirements and design secure solutions (load balancing, Firewalls, routing).
- Troubleshoot and resolve complex cloud infrastructure and application issues.
- Mentor junior engineers and promote knowledge sharing within the team.
- Collaborate with stakeholders, vendors, and cross-functional teams (Cyber Security, Testing, Application).
- Support cloud migration initiatives using frameworks like CAF, AzureRM, Google Cloud.
- Represent the team during project delivery and ensure adherence to change control processes.
- Participate in 24/7 on-call support rota and occasional support for previous adoption work.
Requirements
? Strong DevOps background with automation-first mindset ? Expertise in Azure, Kubernetes, Terraform, GitHub ? Experience in cloud migration and networking solutions ? Ability to lead projects and communicate effectively ? Familiarity with change control processes
Nice to Have
? Cloud certifications (Azure, GCP, etc.) ? Experience with Multi-Tenant solutions ? Passion for continuous learning and innovation