Senior Site Reliability Engineer
Role details
Job location
Tech stack
Job description
As a Senior Site Reliability Engineer, you'll help us balance development velocity with the reliability our customers depend on. You'll partner with engineering teams to shape how their services are measured, lead the work to improve them, and use what you learn from production to build the automation and agentic tooling that improves reliability globally. You'll work fluently with agentic development tools as part of your everyday practice, using them to move faster, to investigate harder problems and to multiply your impact.This is a senior individual contributor role at the intersection of Engineering, Product, Customer Success and Technical Support, where you'll play a meaningful part in shaping how we practice SRE at Jamf., * Partner with engineering teams to define service-level objectives, error budgets, and supporting indicators for their services, and help them use those measures to inform prioritization and reliability investment.
- Investigate complex production issues end-to-end across application, data, infrastructure, and network layers, using AI to correlate logs, metrics, and code and to pressure-test hypotheses before acting.
- Produce clear technical documentation, runbooks, architecture notes, postmortems and proofs of concept for both technical and non-technical audiences, in a form that engineers and AI tools can re-use.
- Identify systemic sources of toil and lead the work to eliminate them through automation, AI agents, tooling, and process change.
- Set the conditions for AI agents to do reliable work in our environment, including repository context, well-specified tasks, integrations such as MCP servers that give AI safe access to the systems it needs, and the tests and guardrails needed for AI-authored change to be trusted.
- Participate in team ceremonies to identify and refine work, communicate findings, and drive opportunities to collaborate.
- Drive cross-team and cross-department collaboration on reliability initiatives, including reviewing designs, influencing roadmaps, and mentoring engineers on SRE practices, including effective AI use in their reliability work.
- Advise senior leadership and stakeholders during critical customer escalations, translating between technical reality and business impact.
- Contribute to scaling the SRE practice itself: improving our standards, our tooling, and how we partner with product engineering teams.
- #LIRemote
Requirements
- Minimum of 5 years experience in software engineering, SRE or production operations roles. (Required)
- Strong production troubleshooting skills across the stack. Ability to diagnose issues from first principles using the tools available (profilers, heap and thread dumps, query plans, traces, logs, metrics).(Required)
- Experience working within a form of the Agile development framework process. (Required)
- Hands-on experience operating production services on AWS (e.g. EC2, S3, EKS, RDS/Aurora, CloudFront). (Required)
- Experience utilizing observability tools (i.e. Grafana, Prometheus, LogicMonitor). (Required)
- Experience creating clear and concise technical documentation that is targeted at both technical and non-technical audiences. (Required)
- Experience writing infrastructure as a code. (Required)
- Experience writing automation in a general-purpose language (e.g. Python, Go, Java, or similar) to a production standard. (Required)
- Strong judgement about how to apply AI effectively across the full range of SRE work, including high-stakes areas such as production access and sensitive data, knowing how to scope and verify work to make it safe. (Required)
- Hands-on experience using agentic development tools (e.g. Claude Code, Cursor, Copilot) to deliver engineering and operational work, scoping and delegating bounded tasks, verifying the output, and shipping with confidence. (Required)
- Experience improving how a team works with AI, for example authoring reusable skills, repository context files, or prompt patterns that others adopt. (Required)
- Experience optimizing SQL queries and database engine tuning. (Preferred)
- Experience with CI/CD Tooling (e.g. Github Actions, Jenkins). (Preferred)
- Exposure to chaos engineering, fault injection and disaster recovery exercises. (Preferred)
- Familiar with FinOps practices. (Preferred)
- 2 year / Associates (Required)
- 4 year / Bachelor's Degree (Preferred)
- A combination of relevant experience and education may be considered
Benefits & conditions
At Jamf, base pay is one part of our total compensation package and is set within a defined range. These ranges can vary based on hiring location. Where an individual's pay falls within that range depends on several factors, including role scope, location, budget, skills, experience, and qualifications. This approach helps ensure fair, competitive pay and provides room to grow as you develop in your role. Pay Transparency Range $113,300-$205,520 USD
What it means to be a Jamf? We are a team of free-thinkers, can-doers, and problem-crushers. We value humility and the relentless pursuit of knowledge. Our culture flows from a spirit of selflessness and relentless self-improvement - driving both personal growth and collective progress throughout our company. We unite around common goals while respecting personal approaches, believing that fulfilled individuals create a thriving, vibrant workplace.