Site Reliability Engineer
EMBL EBI
Hinxton, United Kingdom
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Senior Compensation
£ 75KJob location
Hinxton, United Kingdom
Tech stack
Amazon Web Services (AWS)
Unix
Cloud Computing
Continuous Integration
Linux
DevOps
Elasticsearch
Monitoring of Systems
HP Systems Insight Manager
Python
Linux System Administration
Open Web Application Security
Red Hat Enterprise Linux - RHEL
Reliability Engineering
Ansible
Prometheus
Web Analytics
Web Applications
Web Hosting Services
Web Services
Private Cloud Environment
Gitlab
GIT
Kubernetes
Information Technology
Deployment Automation
Puppet
Rundeck
Terraform
Jenkins
Programming Languages
Job description
- Build and maintain the web hosting platform based on Kubernetes, allowing users to deploy web applications.
- Implement and manage infrastructure and application monitoring based on Prometheus.
- Oversee the web analytics platform currently based on ElasticSearch.
- Utilize CI/CD tools like Gitlab for automation and operational efficiency.
- Ensure documentation meets standard requirements.
- Drive SRE best practices throughout the team.
- Contribute directly to projects and tasks related to production automation.
- Assist and guide team members with the daily prioritisation of tasks.
Technologies:
- Ansible
- CI/CD
- Cloud
- ElasticSearch
- Git
- GitLab
- Jenkins
- Kubernetes
- Linux
- Prometheus
- Puppet
- Python
- RHEL
- Terraform
- Web
- AWS
- DevOps
- Support
- OWASP
- Security
- Unix
Requirements
- Bachelors degree or higher in computer science or a related discipline, or equivalent experience.
- At least 3 years of experience in the design, implementation, and operation of large-scale web hosting platforms.
- Experience managing public-facing production services.
- At least 3 years of experience with automated deployment/configuration methods (e.g., Ansible, Puppet, Terraform).
- Solid experience in Kubernetes deployment and administration in public or private cloud.
- Strong Linux administration skills, ideally with RHEL or a RHEL clone.
- Solid skills in automation tools like Jenkins, Rundeck, or similar.
- Hands-on experience using Git in CI/CD and infrastructure-as-code workflows.
- Solid skills in at least one programming language, ideally Python.
- Experience with methodologies for infrastructure monitoring.
- Solid interpersonal and written English communication skills.
- Proven ability to work well in a team, building positive relationships and sharing knowledge.
- Ability to plan and prioritise workloads., We are seeking a skilled individual to join our Applications Group as a Site Reliability Engineer. Our team is responsible for providing the platforms on which all EBI web services are hosted in Hinxton, near Cambridge, UK. We offer numerous benefits, including generous time off, private medical insurance, and a supportive campus life with recreational activities. Our hiring process is inclusive and we encourage applications from diverse backgrounds. There is potential for contract renewal up to 9 years, and we provide a relocation package for international applicants.
About the company
Working at EMBL-EBI gives you the opportunity to focus your energy and skills on something that really matters: using technology to contribute to discoveries that benefit humankind. We empower researchers everywhere to realise the potential of ‘big data’ in biology, and build sophisticated tools for exploring life at the atomic level.