Arto Liukkonen
I broke the production
#1about 6 minutes
A personal story of breaking production at scale
The speaker recounts causing a major production outage by running a backfill script that overwhelmed the Facebook API and halted data updates.
#2about 2 minutes
Judging intentions versus actions during incidents
We tend to judge others by their actions but ourselves by our intentions, so we should assume good intent from colleagues during incidents.
#3about 2 minutes
Why individual blame is a counterproductive response
When a production issue occurs, it's a system failure, not an individual's fault, as responsibility is shared across developers, reviewers, and processes.
#4about 3 minutes
How to build a psychologically safe blameless culture
Shifting to a blameless culture requires fostering trust, understanding intentions, practicing self-awareness, and owning mistakes without displacing frustration.
#5about 2 minutes
Using blameless postmortems for system-level learning
Blameless postmortems, originating from aviation and healthcare, focus on investigating root causes to strengthen systems rather than assigning individual blame.
#6about 3 minutes
The power of positive feedback in code reviews
Applying the five-to-one ratio of positive to negative interactions can improve team dynamics, especially by adding positive comments during code reviews.
#7about 2 minutes
Using pre-mortems to proactively prevent failures
Pre-mortems are a proactive exercise where teams imagine a project has already failed in order to identify potential risks and edge cases beforehand.
#8about 3 minutes
Incident resolution and key cultural takeaways
The incident took 20 hours to fully resolve but was a valuable learning experience that exposed system flaws and reinforced a healthy team culture.
#9about 2 minutes
Q&A on customer impact and worst production breaks
The speaker answers audience questions about customer reactions to the outage and shares a story about his worst production break involving a failed form.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
29:58 MIN
How engineers handle production errors and monitoring
DevOps at Netflix
07:18 MIN
The pitfalls of a "move fast and break things" culture
Navigating the Future of Junior Developers in Tech
07:18 MIN
The negative impact of 'move fast' culture
Navigating the Future of Junior Developers in Tech
51:53 MIN
Q&A on shared systems and scaling productivity
Forget Developer Platforms, Think Developer Productivity!
04:10 MIN
Shifting from blame to learning in incident analysis
Empathy: The secret sauce of Resilience
24:47 MIN
Audience Q&A on DevOps, tools, and process
GitOps for the people
05:59 MIN
The downward spiral of platform engineering pitfalls
Forget Developer Platforms, Think Developer Productivity!
04:05 MIN
Building a safe space for teams to fail
Fail fast. How embracing failure leads to innovation at ZEISS.
Featured Partners
Related Videos
Answering the Million Dollar Question: Why did I Break Production?
Luís Ventura
Shipping Quality Software In Hostile Environments
Luka Kladaric
What we Learned from Reading 100+ Kubernetes Post-Mortems
Noaa Barki
Chaos in Containers - Unleashing Resilience
Maish Saidel-Keesing
How to Navigate Professional Relationships in the Tech Industry
Rachel Lee Nabors
What I learned as a developer from accidents in space
Andrey Sitnik
Building a culture from chaos
Steve Upton
Empathy: The secret sauce of Resilience
Malin Litwinski
From learning to earning
Jobs that call for the skills explored in this talk.

DevOps Engineer – Kubernetes & Cloud (m/w/d)
epostbox epb GmbH
Berlin, Germany
Intermediate
Senior
DevOps
Kubernetes
Cloud (AWS/Google/Azure)

Lead Fullstack Engineer AI
Hubert Burda Media
München, Germany
€80-95K
Intermediate
React
Python
Vue.js
Langchain
+1
![Senior Software Engineer [TypeScript] (Prisma Postgres)](https://wearedevelopers.imgix.net/company/283ba9dbbab3649de02b9b49e6284fd9/cover/oKWz2s90Z218LE8pFthP.png?w=400&ar=3.55&fit=crop&crop=entropy&auto=compress,format)
Senior Software Engineer [TypeScript] (Prisma Postgres)
Prisma
Remote
Senior
Node.js
TypeScript
PostgreSQL



Tech Lead (m/f/d) - Berlin
Patronus Group
Berlin, Germany
Senior
Kotlin
Android
Angular
Spring Boot
Amazon Web Services (AWS)

Public Sector: DevOps Engineer AWS-Fokus
APRIORI - business solutions AG
DevOps
Gitlab
Docker
Ansible
Grafana
+4


Principal Backend Engineer (Node.js)
Almedia
Berlin, Germany
€80-110K
Senior
NestJS
Node.js
Express
TypeScript
+1