Noaa Barki
What we Learned from Reading 100+ Kubernetes Post-Mortems
#1about 6 minutes
Understanding the developer versus DevOps cultural divide
A story from a DevOps meetup illustrates the different goals and perspectives that create friction between developers and operations teams.
#2about 2 minutes
Bridge the gap with champions and failure stories
Delegate knowledge to developer champions and learn best practices by studying the post-mortem stories of other companies.
#3about 5 minutes
Common Kubernetes misconfigurations from real outages
Examples from Target and Zalando show how simple errors like incorrect CronJob concurrency policies or missing memory limits can cause major production failures.
#4about 3 minutes
How to introduce policy enforcement gradually
Avoid organizational friction by implementing new policies slowly, starting with a single pilot team to gain agreement and understanding before a wider rollout.
#5about 3 minutes
Categorizing the three types of Kubernetes failures
Kubernetes failures typically fall into three categories: simple syntax errors, gaps in knowledge of best practices, and misalignment with internal company policies.
#6about 2 minutes
Validating Kubernetes YAML for syntax and schema errors
Use tools like yq for YAML format validation and kubeconform for schema validation without requiring direct cluster access for developers.
#7about 4 minutes
The challenges of managing policies as code in Git
Managing policies in Git creates versioning nightmares and lacks features for permissions, dynamic adjustments, and providing clear remediation guidelines.
#8about 4 minutes
Using Datree for centralized policy management
Datree is an open-source tool that provides a centralized location for managing policies, which are then enforced locally and in CI for developers.
#9about 1 minute
The real meaning of shifting responsibility left
True shift-left culture is not just about tools but about delegating responsibility and empowering developers to own their configurations.
Related jobs
Jobs that call for the skills explored in this talk.
Team Lead DevOps (m/w/d)
Rhein-Main-Verkehrsverbund Servicegesellschaft mbH
Frankfurt am Main, Germany
Senior
Matching moments
13:51 MIN
Using blameless postmortems for system-level learning
I broke the production
29:43 MIN
Final thoughts and free Kubernetes learning resources
It's all about the Data
10:33 MIN
Tools and techniques for Kubernetes development
How I saved 200K/yr in direct costs writing 0 code lines in K8s
19:04 MIN
Using pre-mortems to proactively prevent failures
I broke the production
05:59 MIN
The downward spiral of platform engineering pitfalls
Forget Developer Platforms, Think Developer Productivity!
37:52 MIN
Key takeaways for an efficient Kubernetes development workflow
Developing locally with Kubernetes - a Guide and Best Practices
00:09 MIN
A DBA's journey to running SQL Server on Kubernetes
Adjusting Pod Eviction Timings in Kubernetes
13:13 MIN
Maintaining quality with automation and observability
Crew Management System for Airlines: Plan duties for pilots & flight attendants worldwide
Featured Partners
Related Videos
Answering the Million Dollar Question: Why did I Break Production?
Luís Ventura
Chaos in Containers - Unleashing Resilience
Maish Saidel-Keesing
Kubernetes Security - Challenge and Opportunity
Marc Nimmerrichter
Kubernetes Security Best Practices
Rico Komenda
Mastering Kubernetes – Beginner Edition
Hannes Norbert Göring
I broke the production
Arto Liukkonen
Kubernetes Maestro: Dive Deep into Custom Resources to Unleash Next-Level Orchestration Power!
Um e Habiba
Operating etcd for Managed Kubernetes
Mario Valderrama
From learning to earning
Jobs that call for the skills explored in this talk.

DevOps Engineer – Kubernetes & Cloud (m/w/d)
epostbox epb GmbH
Berlin, Germany
Intermediate
Senior
DevOps
Kubernetes
Cloud (AWS/Google/Azure)


Cloud & DevOps Engineer Kubernetes, GitOps & Observability
AllatNet Recruiting GmbH & Co. KG
Go
Bash
Azure
DevOps
Python
+8

Cloud & DevOps Engineer Kubernetes, GitOps & Observability
AllatNet Recruiting GmbH & Co. KG
Go
Bash
Azure
DevOps
Python
+8




Cloud and DevOps Engineer (Kubernetes)
DATATRONiQ
Senior
Linux
DevOps
Kubernetes
Scripting (Bash/Python/Go/Ruby)
