Senior Site Reliability Engineer
Role details
Job location
Tech stack
Job description
The Platform & Reliability Engineering team is responsible for defining, measuring, & optimizing the key performance indicators of delivery customers. Your expertise in software engineering and systems administration will be instrumental in building robust and resilient infrastructure.
Partner with the best
In this role, you'll play a pivotal role in shaping the future of our products. You'll collaborate with product teams from the earliest stages of development to ensure the reliability, scalability, and performance of our systems. You'll define key performance indicators (KPIs). Advance the state of monitoring, alerting and operational responses, and investigate complex performance issues.
As a Senior Site Reliability Engineer, you will be responsible for:
- Working on Internet technologies to improve the performance, availability, and scalability of large distributed content delivery systems
- Engaging in collaborative efforts with cross-functional teams, including Product & engineering, to define and establish measurable SLIs and SLOs
- Providing technical expertise and feedback to ensure system designs and implementations meet reliability and performance requirements
- Monitoring platform availability and performance, debug issues by leveraging data analysis skills and implement corrective actions to avoid recurrence
- Developing and implementing automation solutions to improve operational efficiency and reduce toil.
- Participating in design reviews and providing technical guidance to ensure designs meet requirements for scalability, performance, and robustness
Requirements
- Have 5 years of relevant experience and a Bachelor's degree in Computer Science or its equivalent
- Possess familiarity with Internet protocols (DNS/HTTP/TLS/TCP etc.)
- Have experience utilizing Oracle SQL for data integrity checks, root cause analysis of data anomalies, and the development of data reports
- Demonstrate proficiency in Scripting languages (Python, bash, JavaScript etc)
- Have experience with monitoring and alerting systems (e.g., Prometheus, Grafana, ADBMS, Datadog), including metric collection, alerting, dashboarding, and troubleshooting
- Show fluency working in a UNIX/Linux computing environment
Work in a way that works for you
Benefits & conditions
Akamai is committed to fair and equitable compensation practices. For US based candidates only - the base salary for this position ranges from $106,600 - $221,400/year; a candidate's salary is determined by various factors including, but not limited to, relevant work experience, skills, certifications and location. Compensation for candidates outside the US will vary. The compensation package may also include incentive compensation opportunities in the form of annual bonus or incentives, equity awards and an Employee Stock Purchase Plan (ESPP). Akamai provides industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation (in the form of PTO), sick time, family friendly benefits including parental leave and an employee assistance program including a focus on mental and financial wellness; Eligibility requirements apply.