DevOps Engineer- Kubernetes
Role details
Job location
Tech stack
Job description
You will play a crucial role in managing and optimizing our on-premises infrastructure and ensuring the smooth operation of our software development and deployment processes. You will work closely with cross-functional teams to automate, monitor, and enhance our on-premises systems, ensuring they meet high availability, scalability, and security standards. Create standard set of tools and processes which enable our customers to rapidly manage services across their pipelines in an automated, secure and compliant way. What you'll be doing
-
Infrastructure Automation: Design, implement, and maintain infrastructure as code (IAC) using tools like Ansible and Terraform to automate the provisioning and configuration of on-premises servers and resources.
-
Continuous Integration/Continuous Deployment (CI/CD): Set up and manage Jenkins/GitLab,argo CD, Flux, pipelines to automate the build, test, and deployment of software applications across various environments.
-
Scripting and Development: Utilize Python and other scripting languages to develop custom automation scripts and tools.
-
Kubernetes: Kubernetes administration- Install, configure, and maintain Kubernetes clusters (on-prem or cloud), Troubleshoot cluster issues, node failures, and workload performance problems, Implement security best practices (image scanning, secrets management, container hardening), Handle backup/restore (Velero )
-
Monitoring and Logging: Implement and maintain monitoring solutions (e.g., Splunk,Prometheus,Dynatrace) to proactively identify and resolve issues in the on-premises infrastructure. Create alerts and dashboards to ensure system health.
-
Server and OS Management: Manage Linux and Windows servers, including OS patching, software installation, and configuration management.
-
Security and Compliance: Collaborate with security teams to implement and maintain security measures, including compliance standards.
-
Agile Collaboration: Work within Agile development methodologies, participating in sprint planning, stand-ups, and retrospectives to align DevOps efforts with the development and delivery pipeline., When you visit any website, it may store or retrieve information on your browser, mostly in the form of cookies. Because we respect your right to privacy, you can choose not to allow some types of cookies. However, blocking some types of cookies may impact your experience of the site and the services we are able to offer. Required Cookies These cookies are required to use this website and can't be turned off. Show More Details Required Cookies Provider Description Enabled SAP as service provider We use the following session cookies, which are all required to enable the website to function:
-
"route" is used for session stickiness
-
"careerSiteCompanyId" is used to send the request to the correct data centre
-
"JSESSIONID" is placed on the visitor's device during the session so the server can identify the visitor
-
"Load balancer cookie" (actual cookie name may vary) prevents a visitor from bouncing from one instance to another
Requirements
- Hands-on experience with Kubernetes in production clusters
- Strong understanding of Kubernetes core concepts including controllers, scheduling, storage, networking, RBAC, and security
- Install, configure, upgrade, and maintain Kubernetes clusters (on-prem or cloud)
- Implement and manage Network Policies, Ingress controllers, Storage Classes, certificate management, and add-ons (cert-manager, metrics, logging, ingress)
- Troubleshoot cluster, node, and workload performance issues
- Experience with backup/restore tools (e.g., Velero)
- Proficiency with Helm, Kustomize, GitOps (ArgoCD/Flux), and Ingress controllers
- Strong CI/CD skills (GitLab, Jenkins, GitHub Actions)
- Solid experience with Terraform and Ansible
- Experience with monitoring and logging tools (Prometheus/Grafana, ELK, Dynatrace, Splunk)
- Strong Linux administration background
- Proficiency in Bash and one of: Python / Go
- Understanding of networking fundamentals (DNS, load balancers, routing, TLS)
- Knowledge of security best practices and container security techniques (image scanning, secrets management, container hardening)
- Strong problem-solving, analytical, and troubleshooting skills
- Excellent communication and collaboration abilities
- Ability to work in a fast-paced Agile environment and under pressure
- Self-driven, able to learn new technologies independently, with strong planning and organisational skills
NICE TO HAVE
- CKA / CKAD / CKS certifications
- Bachelor's degree in Computer Science, IT, or equivalent experience
- Knowledge of TV and Broadcast systems
- Familiarity with security tools (Qualys, CrowdStrike, Trivy, etc.)
- Experience with VMware Kubernetes Services
- Knowledge of Vault (HashiCorp) or External Secrets
- Familiarity with configuration management tools (Ansible, Puppet, Chef)
- Experience with CI/CD and IaC tooling (Jenkins, GitLab, Terraform, ArgoCD, Flux)
- Knowledge of VMware environment
Benefits & conditions
- 10% on target bonus
- BT Pension scheme, minimum 5% Employee contribution, BT contribution 10%
- Life Assurance Cover
- Exclusive colleague discounts on our latest and greatest BT broadband packages, BT TV with TNT Sports and NOW Entertainment
- Equal family leave: receive 18 weeks at full pay, 8 weeks at half pay and 26 weeks at the statutory rate. It's for all parents, no matter how your family is made up.
- Enhanced women's health support: including help with menopause symptoms, cancer screenings, period care and more.
- 25 days annual leave (not including bank holidays), increasing with service
- 24/7 private virtual GP appointments for UK colleagues
- 2 weeks carer's leave
- World-class training and development opportunities
- Option to join BT Shares Saving schemes