Site Reliability Engineer

TEKsystems

Charing Cross, United Kingdom

6 months ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Charing Cross, United Kingdom

Tech stack

API

Amazon Web Services (AWS)

Build Automation

Azure

Databases

Monitoring of Systems

Python

Performance Tuning

Reliability Engineering

Software Deployment

Software Systems

Programming Languages

Job description

Our client are hiring a Site Reliability Engineering to join their Site Reliability Engineering group whose main objective is to ensure their services are consistently reliable for their customers. As part of a dynamic and self-organizing team, you will have the opportunity to work with modern programming languages and manage software within public cloud environments. Our clients cross-functional teams cover the entire technology stack, from front ends to APIs and databases, providing the resources to build, deploy, and operate their own software., * Support applications in production, including responding to incidents and conducting post-incident reviews.

Apply observability engineering to proactively detect system degradation, understand system state, and quickly diagnose issues.
Investigate and resolve production issues effectively.
Build automation tools to reduce operational toil and enhance developer productivity.
Scope technical projects and break them down into user stories and tasks within an engineering team.
Directly contribute to the design and coding of software systems.
Contribute to building systems that are secure, reliable, scalable, and extensible.
Make informed technical decisions with input from teammates and engage in technical discussions with other engineering teams.
Build and maintain CI/CD pipelines to automate software deployment.
Automate the provisioning and management of infrastructure using Infrastructure as Code tools.
Define, implement, and maintain observability solutions to ensure proactive system monitoring and issue diagnosis.
Diagnose and resolve production issues, including performance tuning and capacity planning.

Requirements

Proficiency in reliability engineering and Python.
Good understanding of observability, including inputting probes to detect production issues.
Experiencewith Infrastructure as Code in cloud environments such as AWS, GCP, or Azure.

Additional Skills & Qualifications

Familiarity with programming languages.
Ability to understand and program applications.
Experiencein the Fintech/Payments industry.

Benefits & conditions

Join a collaborative and innovative environment where you are empowered to make impactful decisions. Our clients commitment to work-life balance, continuous learning, and career development ensures that you can grow personally and professionally while contributing to cutting-edge solutions.

Work Environment

Work in a technologically advanced environment using modern languages and public cloud platforms. Enjoy aflexiblework schedule that promotes a healthy work-life balance. The dress code is casual, reflecting our clients relaxed and inclusive culture.