Lead Performance Engineer

Insight Global
Cleveland, United States of America
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 154K

Job location

Cleveland, United States of America

Tech stack

API
Architectural Patterns
Cloud Computing
Distributed Systems
Monitoring of Systems
Hypertext Transfer Protocols (HTTP)
Networking Basics
Oracle Applications
Reliability Engineering
SAP Applications
PL-SQL
Oracle Fusion Middleware
Data Streaming
Grafana
People Soft
Performance Monitor
Oracle Ebusiness
Oracle Integration
Dynatrace
Microservices

Job description

Insight Global is seeking a Lead Performance Reliability Engineer to join their Production Support organization. This is a critical, high-visibility individual contributor role responsible for proactively identifying and resolving performance issues across enterprise systems before they impact the business.

This position was created to fill a longstanding gap within the organization - bringing dedicated ownership to performance monitoring across a complex ecosystem of Oracle and non-Oracle applications. The ideal candidate will bring a strong background in performance engineering, observability, and system diagnostics across multiple layers of architecture.

What You'll Do

Proactively monitor production environments to identify performance bottlenecks, latency issues, and system inefficiencies before they are reported by end users

Analyze system behavior and performance metrics (e.g., response times, percentiles, baselines) to detect anomalies and recommend improvements

Investigate performance issues across multiple layers, including:

Application (Oracle and non-Oracle systems)

Integration and APIs

Network and infrastructure

Hardware

Partner cross-functionally with application, infrastructure, and networking teams to isolate root causes and drive resolution

Support both ongoing production operations and upcoming project go-lives, ensuring performance readiness and stability

Interpret performance monitoring data and translate insights into actionable recommendations for technical teams and leadership

Provide real-time updates and visibility during high-impact production incidents ("hot seat" environment)

Salary Range: $140k-$154k

Requirements

6-10+ years of experience in performance engineering, performance monitoring, or site reliability engineering

Strong expertise in analyzing performance metrics, including:

Response time analysis (mean vs. average vs. percentile)

KPI tracking and baselining

Experience working with observability and monitoring tools such as:

Dynatrace, Grafana, Elastic, OpenTelemetry, Loki, Tempo (or similar)

Solid understanding of:

HTTP protocols and request/response cycles

Networking fundamentals and data flow across systems

Modern architectures (microservices, APIs, distributed systems)

Proven ability to troubleshoot performance issues across multiple layers of technology, not just within a single application stack

Strong analytical and problem-solving skills with the ability to diagnose complex system behavior Dynatrace, OpenTelemetry, Loki, Tempo, Grafana, Elastic - candidates should be familiar with most of these.

Experience supporting Oracle environments, especially:

Oracle Fusion Cloud (Finance, Supply Chain)

Oracle EBS (R12/11i)

Oracle Integration Cloud (OIC)

PL/SQL experience

Background in large-scale ERP environments (Oracle, SAP, PeopleSoft, etc.)

Experience in production support or L3 support environments with a strong performance focus

Apply for this position