Senior Site Reliability Engineer (SRE) - Operations

SS&C Technologies, Inc.

Kansas City, United States of America

12 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Kansas City, United States of America

Tech stack

Microsoft Windows

Amazon Web Services (AWS)

Systems Engineering

Azure

Bash

Cloud Computing

Configuration Management

Computer Programming

DevOps

Disaster Recovery

Distributed Systems

DNS

Python

Reliability Engineering

Ansible

Prometheus

Shell Script

Datadog

Data Logging

Computer Networking Systems

Google Cloud Platform

Load Balancing

Grafana

Containerization

Kubernetes

Information Technology

Puppet

Terraform

Splunk

Docker

VMware

Microservices

Job description

We are seeking a highly skilled Site Reliability Engineer (SRE) to join our Operations team. In this role, you will be responsible for ensuring the availability, performance, scalability, and reliability of our systems and services. You will work closely with infrastructure, Engineering, DevOps, and security teams to build robust systems, automate operations, and implement best practices for incident response, monitoring, and disaster recovery., + Maintain and improve the uptime, performance, and availability of production systems.

Define and track SLIs, SLOs, and SLAs to ensure service reliability and user satisfaction.

Monitoring & Incident Response

Implement and manage monitoring, alerting, and observability tools (e.g., Prometheus, Grafana, Datadog, ELK).
Participate in on-call rotations and respond to incidents, performing root cause analysis and postmortems.

Automation & Tooling

Automate repetitive tasks and processes using scripts, configuration management, and Infrastructure as Code (IaaC).
Develop CI/CD pipelines to streamline deployment and operational processes.

Capacity Planning & Scaling

Analyze system performance and capacity trends to plan for future growth.
Collaborate with engineering teams to design systems that scale reliably.

Infrastructure Management

Support cloud and/or hybrid infrastructure (AWS, Azure, GCP, VMware, etc.).
Manage system provisioning, configuration, and patching via tools such as Ansible, Terraform, or Puppet.

Collaboration & Culture

Act as a bridge between development and operations teams, championing DevOps and SRE principles.
Contribute to a culture of continuous improvement, reliability, and accountability.

Requirements

Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
3+ years of experience in a Site Reliability, DevOps, or Systems Engineering role.
Experience with Linux/Unix systems, Windows, shell scripting, and administration.
Proficiency in at least one programming/scripting language (Python, Go, Bash, etc.).
Hands-on experience with cloud platforms (AWS, Azure, or GCP).
Strong knowledge of networking, security, load balancing, and DNS.
Experience with monitoring/logging tools (e.g., Prometheus, Grafana, ELK, Splunk, Datadog).

Preferred Qualifications:

Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
Familiarity with ITIL processes, incident/change/problem management frameworks.
Exposure to compliance and security standards (e.g., ISO 27001, SOC 2, HIPAA).
Experience in large-scale distributed systems and microservices architectures.

Soft Skills:

Strong analytical and problem-solving skills.
Excellent communication and collaboration abilities.
Calm under pressure, especially during incidents and outages.
Passion for automation, continuous improvement, and innovation.

Benefits & conditions

Flexibility: Hybrid Work Model & a Business Casual Dress Code, including jeans
Your Future: 401k Matching Program, Professional Development Reimbursement
Work/Life Balance: Flexible Personal/Vacation Time Off, Sick Leave, Paid Holidays
Your Wellbeing: Medical, Dental, Vision, Employee Assistance Program, Parental Leave
Wide Ranging Perspectives: Committed to Celebrating the Variety of Backgrounds, Talents, and Experiences of Our Employees
Training: Hands-On, Team-Customized, including SS&C University
Extra Perks: Discounts on fitness clubs, travel and more!, Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.

SS&C Technologies offers a comprehensive total rewards package designed to support your wellbeing, growth, and future. Our benefits include medical, dental, and vision coverage; a 401(k) plan with company match; paid time off, holidays, and parental leave; and professional development reimbursement opportunity.

About the company

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all