OpenShift Site Reliability Engineer

Infinity Quest
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
£ 68K

Job location

Tech stack

Microsoft Windows
Configuration Management
Continuous Integration
Linux
Distributed Systems
VMware ESX Servers
Hypervisor
Python
Openshift
Red Hat Enterprise Linux - RHEL
Reliability Engineering
Ansible
Prometheus
Ruby
Shell Script
YAML
Data Logging
Scripting (Bash/Python/Go/Ruby)
Grafana
Containerization
Gitlab-ci
Kubernetes
Terraform
Docker
Jenkins
Go
Microservices

Job description

We are seeking a skilled OpenShift Site Reliability Engineer (SRE) to join our team. In this role, you will be responsible for ensuring the reliability, availability, and performance of our OpenShift-based Virtual/container platforms and services with a focus on automation. Work and collaborate across teams, such as Applications, Hardware, and Network. Develop secure service architecture using cloud-native technologies Develop systems, primarily in Shell scripting, YAML, Ruby, Python and Go language, to prevent outages through automatic scanning and remediation Establish and enforce SRE best practices through platform constraints and high-fidelity system modeling Participate in an on-call rotation., 15. Accountability for the control and compliance of the engineering process.

  1. Promote innovation and adoption of cutting-edge specialist technologies and practices with the domain.

  2. Promote development of engineers through coaching, and mentoring.

  3. Consult as required in other areas to assist and provide a different perspective to programmed or projects that require it.

Requirements

  1. Hands-on experience with OpenShift virtualization and Kubernetes administration.

  2. Understanding of distributed systems and common distributed system failure domains Experience managing a production service with RedHat, Windows and ESXi.

  3. Strong knowledge of Linux systems and networking.

  4. Experience with monitoring, logging, alerting & Observability tools (e.g., Otel, Prometheus, Grafana, Slunk etc.).

  5. Proficiency in scripting languages Python, Shell, Go Lang, Terraform etc.

  6. Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI).

  7. Understanding of containerization (Docker) and microservices architecture.

  8. Ansible Configuration Management and Deployment.

  9. Good problem-solving and communication skills.

Soft Skills:

  1. Has experience and affinity to improve team performance

  2. Active listening skills

  3. Mindsets and Behaviors/Self-mastery

  4. Proven experience in Compute, OpenShift, Kubernetes, Hypervisors, Storage, Windows, Networks and Linux

  5. Work with industry groups and vendors outside of the Client to establish and maintain Client's involvement and influence.

Apply for this position