Site Reliability Engineer Performance Testing
Stott and May
1 month ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Remote
Tech stack
Amazon Web Services (AWS)
Azure
Bash
Cloud Computing
Cloud Engineering
Linux
HP Loadrunner
JMeter
Python
Reliability Engineering
Ansible
Prometheus
Scripting (Bash/Python/Go/Ruby)
Load Balancing
Performance Testing
Grafana
Gatling
Reliability of Systems
Gitlab-ci
Kubernetes
Terraform
Jenkins
Job description
We are seeking a skilled Site Reliability Engineer (SRE) - Performance Testing to join our team in Farnborough. The successful candidate will play a key role in designing, implementing, and maintaining performance testing frameworks to ensure application reliability, scalability, and efficiency. This role combines expertise in performance engineering, infrastructure automation, and observability to optimise system performance across production and pre-production environments., * Design and implement performance testing strategies using tools such as JMeter, Gatling, or LoadRunner.
- Monitor and analyse system performance metrics to identify bottlenecks and optimise infrastructure.
- Develop and maintain CI/CD pipelines to support performance testing and automated deployments.
- Collaborate with development and operations teams to ensure applications meet performance and reliability standards.
- Automate operational tasks and environment provisioning using Ansible, Terraform, or Python scripting.
- Build and maintain monitoring and alerting solutions using Prometheus, Grafana, and related tools.
- Participate in incident response and conduct root cause analysis for performance-related issues.
- Document performance benchmarks, testing procedures, and system configurations to support continuous improvement.
Requirements
- Proven experience in performance testing and system reliability engineering.
- Strong hands-on experience with JMeter, Gatling, or LoadRunner.
- Solid understanding of CI/CD pipelines and automation tools such as Jenkins or GitLab CI.
- Proficiency in scripting languages such as Python or Bash.
- Experience with Infrastructure as Code tools (Ansible, Terraform).
- Strong knowledge of monitoring and observability tools (Prometheus, Grafana).
- Familiarity with Linux environments and cloud platforms (AWS, Azure, or GCP).
- Excellent analytical, problem-solving, and communication skills.
Desirable Skills:
- Experience with Kubernetes and container-based environments.
- Exposure to chaos engineering or resilience testing.
- Understanding of network performance tuning and application load balancing.
- Relevant certifications in performance testing or cloud engineering.