Senior Software Engineer II, DevEx, OPX

Samsara Inc.

Raleigh, United States of America

4 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 208K

Job location

Remote

Raleigh, United States of America

Tech stack

Artificial Intelligence

Amazon Web Services (AWS)

Data analysis

Cloud Computing

Python

Systems Development Life Cycle

Software Engineering

Datadog

Grafana

Reliability of Systems

Infrastructure as Code (IaC)

Build Management

Information Technology

Terraform

New Relic (SaaS)

Pagerduty

Job description

Senior Software Engineer II, Operational Excellence (OPX) within the Developer Experience organization. The OPX team ensures production health and system reliability across Samsara, focusing on automated safeguards, incident tooling, observability, and AI-driven operational solutions. This is a remote position for candidates in the Eastern Time Zone. Responsibilities

Design and build automated reliability and self-healing systems that protect production at scale, including rollbacks, deploy safeguards, and fault mitigation, then offer them as platform tooling for use across the company.
Own and improve incident management tooling and on-call health - reduce alert noise, surface actionable signals, and empower engineering teams to operate services with minimal burden.
Develop and evolve observability infrastructure, including monitoring, alerting, SLOs, and performance regression detection, to give teams real-time, actionable visibility into system health and latency.
Contribute to AI-driven operational tooling that moves beyond triage toward autonomous remediation where the system detects issues, takes corrective action, and self-recovers with minimal human involvement.
Drive incident prevention by identifying systemic patterns and eliminating operational toil, with a deep empathy for on-call engineers.
Partner directly with product engineering teams to diagnose reliability gaps, reduce operational burden, and help them adopt best practices for running services.
Define and champion operational excellence best practices across engineering through guardrails, scorecards, and standards that help teams run services reliably by default.
Champion, role model, and embed Samsara's cultural principles (Focus on Customer Success, Build for the Long Term, Adopt a Growth Mindset, Be Inclusive, Win as a Team) as the company scales globally.

Requirements

8+ years of experience designing and building products in a software engineering team.
Bachelor's Degree in Computer Science/Engineering or equivalent experience.
3+ years of experience in infrastructure and/or platform engineering-focused teams.
Expertise in observability and reliability, operational metrics, and data analysis.
Track record in architecting monitoring frameworks, SLO platforms, and automated response workflows; experience with Datadog (or equivalent tools such as New Relic, Grafana).
Experience with large-scale enterprise software applications.
Experience in Developer Experience (DevEx) & Internal Portals: designing and implementing tools that centralize and simplify engineering operations.
Familiarity with cloud platforms (AWS, GCP, or the like).
Experience implementing AI-driven automation across the SDLC to reduce developer friction and accelerate delivery.
Strong coding skills in Go, Python, or equivalent for infrastructure, deployment, and operations challenges.
Mentoring and supporting engineers and role-modeling engineering practices within a technical lead capacity.
Proactive growth mindset with a continual improvement focus.

An ideal candidate also has

Strong communication skills and a desire to collaborate across teams.
Experience with incident management tooling (Incident.io, PagerDuty, or equivalent).
Experience with Infrastructure as Code (IaC) - Terraform.

Benefits & conditions

Annual base salary: $154,700 - $208,000 USD, with potential RSU grants and performance bonuses. Additional benefits include a flexible remote model, professional development stipend, comprehensive health care, and parental leave. Working arrangements

Remote position for candidates residing in the Eastern Time Zone of the US or Canada. No relocation assistance provided. EEO & Accommodations

About the company

Why join us * Impact industry operations that power the global economy, delivering real-world safety and efficiency. * Build for scale with over 2.3 million IoT devices deployed worldwide. * Adopt a learning culture with a growth mindset across diverse technologies. * Engage directly with customers, product managers, and success teams to shape outcomes. * Collaborate as a team driver-owned company toward a connected-operations vision.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all