Site Reliability Engineer (Edge Services), Infrastructure Services

Apple Inc.
Austin, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
$ 132K

Job location

Austin, United States of America

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Azure
System Configuration
Continuous Integration
Data Structures
Software Debugging
Distributed Systems
Python
Linux kernel
Performance Tuning
Release Management
Reliability Engineering
Ansible
Prometheus
Service Design
Software Engineering
Pulumi
Cloud Platform System
Grafana
Generative AI
Containerization
Kubernetes
Vertica
Terraform

Job description

We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive the vision for our visibility, moving beyond simple uptime metrics to build a sophisticated, data-driven reliability framework. You will play a pivotal role in ensuring our services are resilient, scalable, and observable, bridging the gap between complex distributed systems and seamless user experiences., As a key member of the SRE team, your mission is to treat operations as a software problem. You will focus on designing and implementing a next-generation observability and alerting strategy that prioritizes high-cardinality data and meaningful signals over noise. You will spend your time building "self-healing" systems, reducing toil through aggressive automation, and partnering with development teams to bake reliability into the CI/CD pipeline. Your goal is to move us toward a proactive stance where performance bottlenecks are identified and mitigated before they impact the customer.

Requirements

Understanding of Linux internals and deep networking expertise, including HTTP/2, HTTP/3 (QUIC), and HTTPS/TLS. You should be comfortable debugging protocol-level issues and optimizing traffic flow.

Proven ability to automate repetitive tasks and complex workflows using Python or Go

Experience configuring and managing modern monitoring suites (e.g., Prometheus, Grafana, ClickHouse) with a focus on creating actionable, high-signal quality alerting.

Grasp of Data Structures and Algorithms (DSA) to write efficient, performant code and troubleshoot complex system bottlenecks.

Practical knowledge of SLIs, SLOs, Error Budgets, Release Management and Incident Management to drive engineering priorities.

Preferred Qualifications

Experience managing cloud environments (AWS, GCP, or Azure) using Terraform, Ansible, or Pulumi.

Orchestration: Hands-on experience scaling and securing containerized workloads via Kubernetes.

A track record of leading "blameless post-mortems" and using those insights to harden the system against future failures.

Ability to consult with product teams on service design to improve long-term maintainability.

A proactive engineering mindset focused on shifting from "fixing things when they break" to "designing things so they don't break" (or so they fail gracefully).

Practical fluency in applying Generative AI tools within SRE and software engineering workflows - from accelerating observability query construction and alert design to building AI-assisted debugging and triage capabilities that encode institutional knowledge into repeatable, context-aware workflows - with the engineering rigour to validate, own, and iterate on AI-assisted outputs in production-adjacent contexts

Benefits & conditions

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $132,100 and $244,600, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apply for this position