What we Learned from Reading 100+ Kubernetes Post-Mortems
Noaa Barki - 3 years ago
A smart person learns from their own mistakes, but a truly wise person learns from the mistakes of others.
When launching our product, we wanted to learn as much as possible about typical pains in our ecosystem, and did so by reviewing many post-mortems (100+!) to discover the recurring patterns, anti-patterns, and root causes of typical outages in Kubernetes-based systems.
In this talk we have aggregated for you the insights we gathered, and in particular will review the most obvious DON'Ts and some less obvious ones, that may help you prevent your next production outage by learning from others' real world (horror) stories.
When launching our product, we wanted to learn as much as possible about typical pains in our ecosystem, and did so by reviewing many post-mortems (100+!) to discover the recurring patterns, anti-patterns, and root causes of typical outages in Kubernetes-based systems.
In this talk we have aggregated for you the insights we gathered, and in particular will review the most obvious DON'Ts and some less obvious ones, that may help you prevent your next production outage by learning from others' real world (horror) stories.
Jobs with related skills

(Senior) IT Cloud Architekt /Banking (all genders)
msg
·
8 days ago
Frankfurt am Main, Germany
+8
Hybrid

Product Owner (m/w/d) Betrieb – Cloud & SaaS
PROSOZ Herten GmbH
·
25 days ago
Herten, Germany
Hybrid

Senior Software Engineer
Riverty
·
yesterday
Tallinn, Estonia
Hybrid

Head of Development (w/m/d)
aedifion GmbH
·
12 days ago
Köln, Germany
Hybrid
Related Videos