Site Reliability Engineer - Observability Platform
Role details
Job location
Tech stack
Job description
The Site Reliability Engineer (SRE) supports the development and operation of the organization's Application Performance Monitoring (APM) and Observability Platform. This role focuses on expanding the Grafana Tempo distributed tracing system into a comprehensive platform that delivers visibility into application performance, service dependencies, and overall system reliability.
This position operates within the Cloud and Platform Engineering team and partners closely with application engineers to improve instrumentation, monitoring, and reliability practices across services.
Key ResponsibilitiesObservability Platform Development
-
Support the deployment, configuration, and scaling of Grafana Tempo for distributed tracing
-
Integrate application services with OpenTelemetry instrumentation
-
Build and maintain Grafana dashboards and visualizations to surface performance insights
-
Assist in the design and evolution of the APM and observability platform architecture
Reliability & Performance Monitoring
-
Develop dashboards and telemetry pipelines to monitor service health and performance
-
Analyze distributed traces to identify latency bottlenecks and reliability risks
-
Support the definition and monitoring of Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
-
Contribute to operational reviews and continuous reliability improvement initiatives
Platform Engineering
- Support infrastructure deployment and automation for observability and monitoring systems
Requirements
-
Bachelor's degree in Computer Science, Engineering, or a related field required
-
3 years of experience with a Master's degree or 5 years of experience with a Bachelor's degree in:
-
Backend software development
-
Application performance monitoring (APM)
-
Site reliability engineering or production systems support
-
Proficiency working in Linux environments and using command-line tools
-
Programming or scripting experience (e.g., Go, Python, Bash)
-
Foundational understanding of networking and distributed systems
Preferred Qualifications
-
Experience with Kubernetes and containerized environments
-
Familiarity with observability tools such as Grafana, Prometheus, or similar platforms
-
Exposure to OpenTelemetry or distributed tracing systems
-
Experience with cloud platforms (AWS, Azure, or Google Cloud), BE AWARE OF FRAUD: When applying for a job at Jabil you will be contacted via correspondence through our official job portal with a jabil.com e-mail address; direct phone call from a member of the Jabil team; or direct e-mail with a jabil.com e-mail address. Jabil does not request payments for interviews or at any other point during the hiring process. Jabil will not ask for your personal identifying information such as a social security number, birth certificate, financial institution, driver's license number or passport information over the phone or via e-mail. If you believe you are a victim of identity theft, contact the Federal Bureau of Investigations internet crime hotline (www.ic3.gov), the Federal Trade Commission identity theft hotline (www.identitytheft.gov) and/or your local police department. Any scam job listings should be reported to whatever website it was posted in.
Benefits & conditions
Along with growth, stability, and the opportunity to be challenged, Jabil offers a competitive benefits package that includes:
-
Competitive Base Salary
-
Annual Bonus
-
Medical, Dental, Prescription Drug, and Vision Insurance with HRA and HSA options
-
401K Match
-
Employee Stock Purchase Plan
-
Paid Time Off
-
Tuition Reimbursement