Sr Site Reliability Engineer, Customer Systems
Apple Inc.
Austin, United States of America
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Austin, United States of America
Tech stack
Amazon Web Services (AWS)
Cloud Computing
Databases
Couchbase
DNS
Hypertext Transfer Protocols (HTTP)
Python
MongoDB
Network Protocols
Reliability Engineering
Ansible
Prometheus
Shell Script
Transmission Control Protocol (TCP)
Delivery Pipeline
Grafana
Mttr
Generative AI
Kubernetes
Storage Technologies
Information Technology
Low Latency
Cassandra
Hardware Infrastructure
Splunk
Job description
The Customer Systems Team is looking for an experienced Site Reliability Engineer. In this role you will design, build and deliver highly scalable, reliable, secure cloud infrastructure which powers the applications and services used by Apple's customers every day. You will work closely with cross functional teams, business leaders and other partners across Apple to implement new solutions. If infrastructure as code, automation and intelligent monitoring excites you then this is the job for you.
Requirements
- 5+ years of experience in designing and building resilient, large-scale, low latency, cloud and on-prem Infrastructure including Compute, Storage, and Network
- 3+ years of experience with deploying/managing Kubernetes using Helm
- Experience with Shell Scripting, Python, or Ansible
- Experience in monitoring using Splunk, Grafana, Prometheus, Alertmanager
- Deep understanding of networking protocols: DNS, TCP, HTTP/HTTPS
- Experience in setting up and managing CI/CD pipelines
- Bachelor's or Master's in Computer Science or equivalent experience, * Excellent problem solving, critical thinking, and interpersonal skills
- Good communication skills to collaborate with distributed teams
- Experience with Cassandra, MongoDB, Couchbase databases, AWS S3 or similar storage technologies
- Experience in deploying, monitoring and supporting java applications
- Experience with ArgoCD and GitOps model
- Experience in defining, monitoring and achieving key operational metrics like MTTR and SLO
- Experience with GenAI tools in workflow automation for infrastructure management
- Ability to learn new technologies in a short time
About the company
Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined - and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse and inclusive companies in the world, we'd love to hear from you!