Nele Uhlemann
Handling incidents collaboratively is like solving a rubix cube
#1about 4 minutes
The Rubik's Cube metaphor for engineering teams
Different engineering teams like backend and SREs operate on different sides of the system, requiring collaboration during incidents.
#2about 3 minutes
The first phase of resolving incidents collaboratively
The initial step in incident response is to establish a common understanding and transparency across teams before applying quick fixes.
#3about 2 minutes
Preventing future incidents with best practices
After resolving an incident, teams must collaborate on prevention by documenting best practices for patterns like service retries.
#4about 2 minutes
Discovering incidents through system observability
The discovery phase relies on making systems observable by collecting telemetry data like logs, metrics, and traces.
#5about 2 minutes
Standardizing telemetry collection with OpenTelemetry
OpenTelemetry provides a vendor-neutral standard for instrumenting applications, preventing vendor lock-in for observability backends.
#6about 2 minutes
Simplifying metrics with the Autometrics library
The open-source Autometrics library uses decorators to automatically generate key metrics like latency, errors, and request rate from functions.
#7about 5 minutes
Demo of generating metrics and SLOs from code
A live demo shows how Autometrics provides live metrics in the IDE and helps define SLOs that can be visualized in Grafana.
#8about 1 minute
Summary of collaborative incident management phases
A recap of the three key phases for collaborative incident handling: resolving, preventing, and discovering issues together.
#9about 2 minutes
Q&A on tooling and open source contribution
The speaker answers audience questions about managing tool complexity and the role of community contributions in open-source projects.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
06:30 MIN
Applying agile and SRE principles to incident response
Applying Agile Principles to Incident Management
27:09 MIN
Actionable takeaways for SREs on incident management
Serverless Observability: where SLOs meet transforms
20:29 MIN
Using an incident console to manage response and resolvers
Applying Agile Principles to Incident Management
24:30 MIN
Fostering cross-team collaboration with SLOs
Serverless Observability: where SLOs meet transforms
29:58 MIN
How engineers handle production errors and monitoring
DevOps at Netflix
18:09 MIN
Overcoming observability challenges with a unified platform
All your telemetry data from any source in one place
28:08 MIN
Improving incident response to make on-call less painful
What Developers Get Wrong About Application Quality
13:39 MIN
Building resilience across all software stack layers
System Resilience: Surviving the Software Storm
Featured Partners
Related Videos
Applying Agile Principles to Incident Management
Tobias Dunn-Krahn
Mastering AI-Driven Problem Solving in Engineering with Observability
Jemiah Sius
Empathy: The secret sauce of Resilience
Malin Litwinski
The AI-Ready Stack: Rethinking the Engineering Org of the Future
Jan Oberhauser, Mirko Novakovic, Alex Laubscher & Keno Dreßel
SRE Methods In an Agency Environment
Martin Beránek
Metrics Handle with Care: The Paradox of Measuring Team Performance
Stefan Stelzer & Volker Zöpfel
Breaking Silos: Successful Collaboration Between Tech & Business Teams in Complex Enterprise Systems
Stefan Menschner & Alexander Weißhaupt
Unveiling the Dark Side: Navigating the Pitfalls of Digital Ambitions
Johannes Hansen
From learning to earning
Jobs that call for the skills explored in this talk.

Senior Backend Engineer Electrical Engineering
envelio
Köln, Germany
Remote
Senior
Python
Software Architecture

DevOps Engineer – Kubernetes & Cloud (m/w/d)
epostbox epb GmbH
Berlin, Germany
Intermediate
Senior
DevOps
Kubernetes
Cloud (AWS/Google/Azure)


Cloud Engineer (m/w/d)
fulfillmenttools
Köln, Germany
€50-65K
Intermediate
TypeScript
Google Cloud Platform
Continuous Integration

Backend Engineer (m/w/d)
fulfillmenttools
Köln, Germany
€35-65K
Intermediate
TypeScript
Agile Methodologies
Google Cloud Platform
![Senior Software Engineer [TypeScript] (Prisma Postgres)](https://wearedevelopers.imgix.net/company/283ba9dbbab3649de02b9b49e6284fd9/cover/oKWz2s90Z218LE8pFthP.png?w=400&ar=3.55&fit=crop&crop=entropy&auto=compress,format)
Senior Software Engineer [TypeScript] (Prisma Postgres)
Prisma
Remote
Senior
Node.js
TypeScript
PostgreSQL

Domain Architect Ricardo Platform (f/m/d) | 80-100% | Hybrid working model | Valbonne France
SMG Swiss Marketplace Group
Canton de Valbonne, France
Senior

Senior Fullstack Engineer – Angular/.Net (f/m/d)
Apaleo
München, Germany
Remote
€65-85K
Senior
.NET
Angular
JavaScript
+1

Lead Full-Stack Engineer
Hubert Burda Media
München, Germany
€80-95K
Intermediate
Senior
React
Vue.js
Node.js
MongoDB
+1