Site Reliability Engineer (SRE)

Apex Systems LLC
Plano, United States of America
4 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 153K

Job location

Plano, United States of America

Tech stack

Proxy Servers
Application Services
Continuous Delivery
Continuous Integration
Linux
DNS
Elasticsearch
Perl
Design of User Interfaces
Monitoring of Systems
Python
Reliability Engineering
Site Reliability Engineering Practices
Logstash
Ansible
Shell Script
Software Engineering
Load Balancing
Cloud Platform System
Firewalls (Computer Science)
GIT
Kibana
Terraform
Splunk
Dynatrace
Jenkins

Job description

We are seeking a Site Reliability Engineer (SRE) to operate hands-on across the stack to improve platform and application observability, drive reliability improvements, and deliver measurable gains in operational efficiency. This role will work closely with core teams to execute platform modernization, harden production systems, and evolve support tooling. This position is critical to maintaining execution velocity, reducing operational risk, and ensuring reliability and performance objectives are met., * Collaborate with engineers and architects to design, develop, test, and implement secure, robust, and scalable solutions for applications and platforms.

  • Design and implement deployment approaches using automated continuous integration and continuous delivery pipelines.
  • Take responsibility for all aspects of reliability, collaborating with technical experts to resolve complex problems.
  • Utilize SRE practices, service level indicators, and service level objectives to proactively resolve issues.
  • Gather, analyze, and develop visualizations from large, diverse data sets to support continuous platform improvement.
  • Identify opportunities to eliminate toil and automate the triage of issues to improve operational stability.
  • Collaborate with a global team to identify, analyze, and resolve platform vulnerabilities.
  • Promote the adoption of site reliability engineering best practices within the team and organization.

Requirements

Experience: A minimum of 5 years of combined experience in SRE, software development, or infrastructure engineering.

Technical Skills:

  • Experience in implementing, monitoring, and maintaining highly scalable and resilient application services and platforms.
  • Experience with monitoring tools such as OpenTelemetry (OTel), ELK (Elasticsearch, Logstash, Kibana), Splunk, and Dynatrace.
  • Knowledge of Python, Shell, or Perl scripting.
  • Proficiency in implementing CI/CD pipelines with tools such as Git and Jenkins.
  • Advanced knowledge of networking, including firewalls, DNS, Load Balancing, and Proxies.
  • Advanced understanding of the Linux operating system, including shell scripting and core commands for automation.
  • Experience with Ansible for writing playbooks and using core modules.

Professional Skills:

  • Excellent interpersonal, organizational, and communication skills are required.
  • Must be self-motivated and results-oriented with analytical and problem-solving skills.

Preferred Qualifications

  • UI/UX experience to provide oversight on best practices for tooling.
  • Hands-on experience with Terraform for Infrastructure as Code (IaC).
  • Background in a large enterprise environment.
  • Ability to analyze and resolve complex infrastructure issues.
  • Capacity to work in a fast-paced environment and meet deadlines.

Benefits & conditions

The pay rate for this position is $73.68 per hour. Contract employees are eligible for benefits, including medical, dental, and vision insurance options.

About the company

Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Apex uses a virtual recruiter as part of the application process. Click for more details.

Apply for this position