Senior Site Reliability Engineer

Ss&c Technologies, Inc.

Charing Cross, United Kingdom

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Charing Cross, United Kingdom

Tech stack

Microsoft Windows

Artificial Intelligence

Airflow

Application Layers

Bash

Linux

Monitoring of Systems

Python

Korn Shell

PostgreSQL

Microsoft SQL Server

MongoDB

NoSQL

Oracle Applications

Powershell

Reliability Engineering

Ansible

Prometheus

Shell Script

Software Engineering

Scripting (Bash/Python/Go/Ruby)

Large Language Models

Grafana

Mttr

Containerization

Kubernetes

Information Technology

Cassandra

Terraform

Job description

Operated within the SS&C WIT business, Genesis is an all-new investment operations platform that provides extensive asset class and functional support across the front, middle, and back office. Built natively for the cloud with advanced technology, Genesis features an innovative user experience, actionable monitors, notifications, and alerts infused with AI., * Maintain shared ownership for providing production level resilience and reliability for business-critical systems.

Leverage industry-standard observability technologies to provide a centralized view of system and service health.
Implement and continually improve monitoring and alerting based on harvested logs, metrics and traces.
Lead incident response, post incident reviews and post remediation improvements.
Define and establish KPIs, SLIs and SLOs in support of agreed service levels.
Develop and maintain automation, and leverage generative AI technologies to reduce operational toil, improve MTTD and MTTR.
Take on new support for additional technical service components as the service evolves. Support, mentor and train SRE Engineers.
Work with other teams to maintain a sound knowledge of all aspects of the application technical architecture.
Contribute to building up and maintaining a knowledge base in support of the technical role.
Maintain and awareness of, comply with and champion the stated service controls required to achieve audit compliance.

Requirements

The role requires an in-depth knowledge of observability principles and strong experience in implementing the observability stack across infrastructure, data and application layers for real time, compute intensive, distributed environments. The Senior SRE Engineer will have a solid understanding of cloud platforms and container orchestration. They will have a comprehensive grasp of incident management and operational risk mitigation and experience in implementing automation frameworks to minimize toil and reduce MTTD/MTTR. They will have proven experience in using infrastructure as code and familiarity with AI-driven operational tooling. Logical thinkers with strong problem solving and communication skills and a desire to effect continuous improvements., * Bachelor's degree in Computer Science, Software Engineering, or a related field.

ITIL foundation level or experience working in an ITIL framework preferred.
4+ years of Linux OS and Windows OS systems management experience.
4+ years of experience with observability technologies for system monitoring and alerting technologies (e.g. Prometheus, Grafana, Loki).
2+ years working in a team environment with operational responsibilities for client facing applications.
2+ years of experience with containerization technologies and Kubernetes.
Proven scripting skills in at least one of Linux shell scripting (csh, ksh, Bash or Windows PowerShell), Ansible, Terraform or Python.
Working experience in use of versatile workload automation / enterprise scheduling tools such as Airflow.
Working experience with, and a technical understanding of, NoSQL DBs such as MongoDB/Cassandra and traditional relational DBs such as SQL Server/Oracle/Postgre.
Working experience of a cloud self-service environment.
Working experience of LLM or AI usage in monitoring and observability stacks.

About the company

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology., SS&C is a global financial technology and software-enabled services company that provides mission-critical solutions primarily to the financial services and healthcare industries. It is headquartered in Windsor, Connecticut, USA, and is publicly listed on the NASDAQ. SS&C is widely regarded as one of the largest administrators of hedge fund and private equity operations and the largest mutual fund transfer agency globally., Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.