Site Reliability Engineer

TEKsystems
Charing Cross, United Kingdom
1 month ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Charing Cross, United Kingdom

Tech stack

API
Amazon Web Services (AWS)
Build Automation
Azure
Databases
Monitoring of Systems
Python
Performance Tuning
Reliability Engineering
Software Deployment
Software Systems
Programming Languages

Job description

Our client are hiring a Site Reliability Engineering to join their Site Reliability Engineering group whose main objective is to ensure their services are consistently reliable for their customers. As part of a dynamic and self-organizing team, you will have the opportunity to work with modern programming languages and manage software within public cloud environments. Our clients cross-functional teams cover the entire technology stack, from front ends to APIs and databases, providing the resources to build, deploy, and operate their own software., * Support applications in production, including responding to incidents and conducting post-incident reviews.

  • Apply observability engineering to proactively detect system degradation, understand system state, and quickly diagnose issues.
  • Investigate and resolve production issues effectively.
  • Build automation tools to reduce operational toil and enhance developer productivity.
  • Scope technical projects and break them down into user stories and tasks within an engineering team.
  • Directly contribute to the design and coding of software systems.
  • Contribute to building systems that are secure, reliable, scalable, and extensible.
  • Make informed technical decisions with input from teammates and engage in technical discussions with other engineering teams.
  • Build and maintain CI/CD pipelines to automate software deployment.
  • Automate the provisioning and management of infrastructure using Infrastructure as Code tools.
  • Define, implement, and maintain observability solutions to ensure proactive system monitoring and issue diagnosis.
  • Diagnose and resolve production issues, including performance tuning and capacity planning.

Requirements

  • Proficiency in reliability engineering and Python.
  • Good understanding of observability, including inputting probes to detect production issues.
  • Experiencewith Infrastructure as Code in cloud environments such as AWS, GCP, or Azure.

Additional Skills & Qualifications

  • Familiarity with programming languages.
  • Ability to understand and program applications.
  • Experiencein the Fintech/Payments industry.

Benefits & conditions

Join a collaborative and innovative environment where you are empowered to make impactful decisions. Our clients commitment to work-life balance, continuous learning, and career development ensures that you can grow personally and professionally while contributing to cutting-edge solutions.

Work Environment

Work in a technologically advanced environment using modern languages and public cloud platforms. Enjoy aflexiblework schedule that promotes a healthy work-life balance. The dress code is casual, reflecting our clients relaxed and inclusive culture.

Apply for this position