Sr. Site Reliability Engineer (SRE)

Cognizant Technology Solutions Corporation
Plano, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Plano, United States of America

Tech stack

Amazon Web Services (AWS)
Business Software
Cloud Computing
DevOps
Fault Tolerance
Github
Reliability Engineering
Prometheus
Software Engineering
Datadog
Cloud Platform System
Grafana
Reliability of Systems
Kubernetes
Information Technology
Terraform
Splunk
Software Version Control
Docker
ELK

Job description

As a Senior Site Reliability Engineer , you will make an impact by designing and operating scalable, resilient, and highly available cloud-native platforms that support critical business applications. You will be a valued member of a cross-functional engineering team and work collaboratively with software engineers, architects, and product partners to embed reliability, automation, and performance best practices across the delivery lifecycle.

In this role, you will:

  • Design and implement scalable, fault-tolerant architectures aligned with business, performance, and reliability objectives
  • Embed Site Reliability Engineering (SRE) and DevOps principles into application development and operations workflows
  • Build and maintain infrastructure and automation using tools such as Terraform, Docker, and Kubernetes to reduce manual effort and improve system resilience
  • Implement and manage CI/CD pipelines and source control practices using GitHub to streamline development and deployment
  • Monitor, measure, and optimize system health using observability platforms such as Datadog, Prometheus, Grafana, Splunk, and the ELK stack

Work model

We believe hybrid work is the way forward as we strive to provide flexibility wherever possible. Based on this role's business requirements, this is a hybrid position requiring 2-3 days per week in a client or Cognizant office in Plano, TX . Regardless of your working arrangement, we are here to support a healthy work-life balance through our various wellbeing programs.

The working arrangements for this role are accurate as of the date of posting. This may change based on the project you're engaged in, as well as business and client requirements. Rest assured; we will always be clear about role expectations.

Requirements

  • Strong experience in Site Reliability Engineering (SRE) and/or DevOps supporting large-scale, cloud-native systems
  • 10+ years of experience
  • Hands-on expertise with AWS and container orchestration technologies such as Docker and Kubernetes
  • Proficiency with infrastructure-as-code and automation tools, including Terraform
  • Experience implementing monitoring, alerting, and incident response using observability tools such as Datadog, Prometheus, Grafana, Splunk, and the ELK stack
  • Bachelor's degree in Computer Science, Information Technology, or a related field

These will help you stand out

  • AWS Certified Solutions Architect or equivalent cloud certification
  • Experience implementing reliability best practices such as SLIs, SLOs, error budgets, and proactive incident management
  • Strong analytical and troubleshooting skills with the ability to diagnose complex infrastructure and application issues
  • Experience mentoring engineers and contributing to a culture of continuous improvement and knowledge sharing

Benefits & conditions

The annual salary for this position depends on experience and other qualifications of the successful candidate., Cognizant offers the following benefits for this position, subject to applicable eligibility requirements:

  • Medical/Dental/Vision/Life Insurance
  • Paid holidays plus Paid Time Off
  • 401(k) plan and contributions
  • Long-term and Short-term Disability
  • Paid Parental Leave
  • Employee Stock Purchase Plan

Disclaimer

The salary, other compensation, and benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.

Apply for this position