Lead Site Reliability Engineer
EVER FORTH LLC
Charlotte, United States of America
2 days ago
Role details
Contract type
Temporary to permanent Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Senior Compensation
$ 154KJob location
Charlotte, United States of America
Tech stack
Tomcat
Application Performance Management
Systems Engineering
Confluence
JIRA
CA Workload Automation Ae
Azure
Cloud Computing
Linux
JMeter
Python
Load Testing
Lookup Table
Powershell
Reliability Engineering
Site Reliability Engineering Practices
Ansible
Shell Script
Datadog
Data Logging
Grafana
Containerization
Blazemeter
Kubernetes
Atlassian Tools
Kafka
Splunk
Appdynamics
Docker
Job description
- Drive the adoption of Site Reliability Engineering practices and culture.
- Build and evolve observability, monitoring, logging, synthetic monitoring, and chaos engineering capabilities.
- Design and deliver Grafana and Splunk dashboards that integrate multiple telemetry sources.
- Enable self-healing and autonomic capabilities through automation and analytics.
- Automate key SRE and IT Service Operations metrics, including customer impact, availability of critical business flows, SLO/SLI adherence, and error budgets.
- Integrate alerts with incident management, notification, and unified communications systems.
- Participate in 24x7 application support, on-call rotations, incident response, and remediation.
- Conduct blameless post-mortems and root cause analysis to introduce continuous improvement and eliminate repeat incidents.
Requirements
- 5-7 years of Infrastructure Engineering or Systems Engineering experience, supporting complex enterprise environments.
- Strong hands-on experience with Observability tooling, specifically Splunk (dashboards, reports, lookup tables, summary indexes) and Grafana.
- 4+ years of application production support experience.
- Strong experience supporting Linux-based platforms.
- 2+ years using Agile tools such as JIRA and/or Confluence.
- Ability to operate in a 24x7 support environment with on-call rotations.
Preferred Qualifications
- Experience with Site Reliability Engineering (SRE) practices.
- Azure cloud experience, especially with observability in cloud-native environments.
- Automation experience using Python, Ansible, PowerShell, or Shell scripting.
- Experience with AIOps tooling (e.g., MoogSoft, BigPanda).
- Experience with container platforms such as Kubernetes or Docker.
- Application performance and load testing experience with tools like BlazeMeter, JMeter, or AppDynamics.
- Experience with Kafka, MQ messaging, Tomcat, or Autosys batch management.
Benefits & conditions
The pay rate for this position is between $69.00 and $74.00 per hour. Contract employees are eligible for benefits, including medical, dental, and vision insurance options.
About the company
Everforth Apex is a world-class IT services company that serves thousands of clients across the globe. When you join Everforth Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Everforth Apex uses a virtual recruiter as part of the application process. Click for more details.