Cloud Network Reliability Engineer

Apple Inc.
Sunnyvale, United States of America
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
$ 318K

Job location

Sunnyvale, United States of America

Tech stack

Adobe InDesign
Systems Engineering
Cloud Computing
Distributed Systems
Fault Tolerance
Protocol Buffers
Monitoring of Systems
Hardware Virtualization
OSI Models
JSON
Network Configuration and Change Management
Network Architecture
Network Control
Network Service
OpenStack
Service Discovery
Software Engineering
System Programming
XML
Data Logging
Cloud-native Network Functions (CNF)
Multithreading
Concurrency
Mttr
Multi-Cloud
Caching
Kubernetes
Infrastructure Automation Frameworks
SDN Network
Low Latency
Build Tools
Api Design
REST

Job description

We are seeking an experienced and visionary Cloud Network Reliability Engineer to drive the technical strategy and execution for ensuring the availability, performance, scalability, and resiliency of Apple's global network services. In this role, you will work as a technical leader solving complex networking challenges at massive scale, partnering with engineering, infrastructure, and operations teams across Apple to deliver reliable, fault-tolerant systems.., As a technical leader within the Cloud Networking organization, you will define and drive the reliability and resiliency architecture for Apple's network platform services. You will be responsible for establishing SRE and SWE best practices, architecting fault-tolerant network control and data planes, and championing data-driven decision-making through observability and automation.

You will drive resilient cloud networking solutions that operate reliably across multiple cloud providers and global regions, handling failures gracefully and maintaining service availability. Your technical leadership will ensure Apple's network services meet demanding availability, latency, resilience, and security requirements while continuously improving operational maturity.

We are looking for a technical expert who deeply understands cloud networking at scale, is passionate about operating mission-critical, globally distributed infrastructure, preventing outages through proactive engineering, and driving long-term reliability improvements through architectural excellence.

","responsibilities":"Define and drive the long-term technical vision, architecture, and reliability strategy for large-scale cloud networking platforms spanning control plane and data plane systems.

Architect and evolve fault-tolerant, highly available network services, ensuring graceful degradation and consistent performance under partial and systemic failure scenarios.

Establish platform-wide resiliency patterns including service discovery, health checking, automated failover, rate limiting, circuit breaking, and traffic management across multi-region and multi-cloud environments.

Lead the design of network configuration management, routing state distribution, traffic engineering, and capacity planning systems, balancing scalability, correctness, and operational simplicity.

Serve as a senior technical authority and architectural reviewer, influencing critical design decisions across multiple teams and ensuring network failure modes are explicitly addressed.

Build and champion automation-first reliability solutions, including topology discovery, deployment safety mechanisms, self-healing systems, and operational tooling that reduce toil and improve MTTR.

Define and own reliability metrics and observability standards (SLIs, SLOs, error budgets), using data to drive engineering trade-offs, reliability investments, and incident response improvements.

Multiply impact through cross-team technical leadership, embedding reliability early in design, mentoring engineers, and sharing deep technical knowledge through documentation and technical talks.

Requirements

Do you have experience in Systems engineering?, Expert knowledge of API design and interface technologies (JSON, ProtoBuf, REST, RPC, XML, etc)

In depth knowledge of K8s, OpenStack, system virtualization, build systems and infrastructure as code

Strong knowledge of observability systems (metrics, logging, tracing) and qualification engineering.

Broad knowledge of networking solutions across OSI layers 3 through 7.

Excellent written and verbal communication skills with the ability to clearly articulate risk, reliability trade-offs, and operational priorities.

Proven ability to manage competing priorities, drive initiatives to completion, and deliver results in fast-paced environments.

Minimum Qualifications

Extensive experience in software engineering, systems engineering, or infrastructure engineering.

Strong background in designing, operating, and supporting highly available, fault-tolerant distributed systems at hyper scale.

Strong systems programming skills including multi-threading, concurrency, caching, batching

Solid understanding of network infrastructure and software-defined networking (SDN).

Ability to lead cross-functional collaboration and influence technical decisions across teams.

Benefits & conditions

4.14.1 out of 5 stars Sunnyvale, CA $212,000 - $318,400 a year, Pulled from the full job description

  • Employee stock purchase plan
  • Health insurance
  • Retirement plan
  • Dental insurance
  • RSU, At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $212,000 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

About the company

Apple Cloud Networking team builds and operates large-scale, software-defined networking platforms that enable secure, resilient, and highly available multi-cloud connectivity with a global footprint. Our infrastructure powers critical Apple services such as iCloud, iTunes, Siri, and Maps.

Apply for this position