Lead Performance Engineer
Role details
Job location
Tech stack
Job description
Insight Global is seeking a Lead Performance Reliability Engineer to join their Production Support organization. This is a critical, high-visibility individual contributor role responsible for proactively identifying and resolving performance issues across enterprise systems before they impact the business.
This position was created to fill a longstanding gap within the organization - bringing dedicated ownership to performance monitoring across a complex ecosystem of Oracle and non-Oracle applications. The ideal candidate will bring a strong background in performance engineering, observability, and system diagnostics across multiple layers of architecture.
What You'll Do
Proactively monitor production environments to identify performance bottlenecks, latency issues, and system inefficiencies before they are reported by end users
Analyze system behavior and performance metrics (e.g., response times, percentiles, baselines) to detect anomalies and recommend improvements
Investigate performance issues across multiple layers, including:
Application (Oracle and non-Oracle systems)
Integration and APIs
Network and infrastructure
Hardware
Partner cross-functionally with application, infrastructure, and networking teams to isolate root causes and drive resolution
Support both ongoing production operations and upcoming project go-lives, ensuring performance readiness and stability
Interpret performance monitoring data and translate insights into actionable recommendations for technical teams and leadership
Provide real-time updates and visibility during high-impact production incidents ("hot seat" environment)
Salary Range: $140k-$154k
Requirements
6-10+ years of experience in performance engineering, performance monitoring, or site reliability engineering
Strong expertise in analyzing performance metrics, including:
Response time analysis (mean vs. average vs. percentile)
KPI tracking and baselining
Experience working with observability and monitoring tools such as:
Dynatrace, Grafana, Elastic, OpenTelemetry, Loki, Tempo (or similar)
Solid understanding of:
HTTP protocols and request/response cycles
Networking fundamentals and data flow across systems
Modern architectures (microservices, APIs, distributed systems)
Proven ability to troubleshoot performance issues across multiple layers of technology, not just within a single application stack
Strong analytical and problem-solving skills with the ability to diagnose complex system behavior Dynatrace, OpenTelemetry, Loki, Tempo, Grafana, Elastic - candidates should be familiar with most of these.
Experience supporting Oracle environments, especially:
Oracle Fusion Cloud (Finance, Supply Chain)
Oracle EBS (R12/11i)
Oracle Integration Cloud (OIC)
PL/SQL experience
Background in large-scale ERP environments (Oracle, SAP, PeopleSoft, etc.)
Experience in production support or L3 support environments with a strong performance focus