Platform Engineering & Production Support

Everforth Apex
Charlotte, United States of America
6 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Charlotte, United States of America

Tech stack

Java
Relational Databases
DevOps
Distributed Systems
Python
Openshift
Red Hat Enterprise Linux - RHEL
Release Management
Reliability Engineering
Site Reliability Engineering Practices
Prometheus
Cloud Platform System
React
Grafana
Spring-boot
Build Management
Kubernetes
Kafka
Splunk
Appdynamics
ServiceNow
Microservices

Job description

We are seeking a Principal Engineer for a platform engineering team. This role is responsible for stabilizing, scaling, and operating applications as they approach production release. The position requires a professional with a strong background in DevOps and Site Reliability Engineering (SRE), with expertise in observability, incident management, and cloud platforms. The individual must be prepared to operate in a fast-paced, production-critical environment., * Lead production support efforts for a portfolio of over 20 applications to ensure stability and performance.

  • Design and build monitoring, alerting, and observability dashboards using tools such as Splunk, Grafana, AppDynamics, and Prometheus.
  • Identify risks through gap analysis, anomaly detection, and predictive alerting to prevent production incidents.
  • Troubleshoot complex production issues across distributed microservices environments.
  • Drive the adoption of modern SRE practices, including automation and intelligent monitoring solutions.
  • Support applications running on OpenShift and cloud-native platforms, focusing on reliability and scalability.
  • Collaborate with development teams during release cycles to provide production-readiness guidance.
  • Participate in a 24x7 on-call rotation to address incidents.
  • Mentor engineers to elevate team capabilities in SRE, DevOps, and platform engineering.

Requirements

Experience: 10+ years of experience in platform engineering and production support.

Technical Skills:

  • 5+ years with Red Hat Linux, OpenShift, Kubernetes, Java, microservices, Spring Boot, and Python.
  • 5+ years of experience creating observability dashboards with Grafana, Splunk, and AppDynamics.
  • 5+ years of experience with observability alerts and incident handling, including AIOps, ServiceNow, or BigPanda.
  • 4+ years with React.js, Apache Kafka, and relational databases.
  • 4+ years with distributed systems, microservices architectures, and cloud-native platforms.

Preferred Qualifications

  • Experience in the financial services industry.
  • A background in development, particularly within Java-based ecosystems.
  • Experience with AIOps tools like ServiceNow or BigPanda.
  • Familiarity with Kafka and React.js.

About the company

Everforth Apex is a world-class IT services company that serves thousands of clients across the globe. When you join Everforth Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Everforth Apex uses a virtual recruiter as part of the application process. Click for more details.

Apply for this position