Andrew Pruski
Adjusting Pod Eviction Timings in Kubernetes
#1about 3 minutes
A DBA's journey to running SQL Server on Kubernetes
The speaker shares his background as a SQL Server DBA and the efficiency gains that led him to explore running stateful workloads in Kubernetes.
#2about 2 minutes
Why the default five-minute failover is unacceptable
When a node fails, Kubernetes waits five minutes by default before rescheduling pods, which is too long for stateful applications like SQL Server.
#3about 5 minutes
Demonstrating the default pod eviction delay in action
A live demo shows an nginx pod taking five minutes to be rescheduled to a healthy node after its original node is shut down in AKS.
#4about 2 minutes
How to configure faster pod eviction with tolerations
Pod eviction time can be reduced from five minutes to seconds by setting tolerations for not-ready and unreachable nodes in the deployment YAML.
#5about 3 minutes
Demo of a ten-second failover using tolerations
By adding tolerations to the nginx deployment, a new pod is spun up on a healthy node just ten seconds after the original node fails.
#6about 7 minutes
Why fast pod eviction fails for stateful apps
A demo with SQL Server shows that even with tolerations, the new pod gets stuck in a "ContainerCreating" state due to a multi-attach error on the persistent volume.
#7about 7 minutes
Achieving high availability with Portworx storage
A third-party tool called Portworx provides a storage class that correctly detaches and reattaches storage, enabling a SQL Server pod to fail over successfully in seconds.
#8about 1 minute
Key considerations for stateful app high availability
Achieving high availability for stateful apps in Kubernetes requires adjusting pod tolerations and using a storage solution that can handle volume reattachment across nodes.
Related jobs
Jobs that call for the skills explored in this talk.
VECTOR Informatik
Stuttgart, Germany
Senior
Kubernetes
Terraform
+1
Matching moments
03:17 MIN
Selecting strategic partners and essential event tools
Cat Herding with Lions and Tigers - Christian Heilmann
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
01:32 MIN
Organizing a developer conference for 15,000 attendees
Cat Herding with Lions and Tigers - Christian Heilmann
02:39 MIN
Establishing a single source of truth for all data
Cat Herding with Lions and Tigers - Christian Heilmann
04:49 MIN
Using content channels to build an event community
Cat Herding with Lions and Tigers - Christian Heilmann
03:38 MIN
Balancing the trade-off between efficiency and resilience
What 2025 Taught Us: A Year-End Special with Hung Lee
04:57 MIN
Developing resilience by expanding your capacity for failure
What 2025 Taught Us: A Year-End Special with Hung Lee
04:27 MIN
Moving beyond headcount to solve business problems
What 2025 Taught Us: A Year-End Special with Hung Lee
Featured Partners
Related Videos
Databases on Kubernetes
Denis Souza Rosa
It's all about the Data
Michael Cade
Databases on Kubernetes: Why you should care
Denis Wilson Souza Rosa & Steffen Schneider
Azure-Well Architected Framework - designing mission critical workloads in practice
Paweł Siwek
Winning the Hybrid Cloud
Alex Soto
Mastering Kubernetes – Beginner Edition
Hannes Norbert Göring
Kubernetes Security Best Practices
Rico Komenda
Kubernetes Security - Challenge and Opportunity
Marc Nimmerrichter
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

KA RESOURCES
Remote
€65-75K
Intermediate
Linux
VMware
Routing
+2




Hamilton Barnes
Remote
£143K
API
Grafana
Prometheus
+1



Rocken AG
Remote
Bash
Azure
DevOps
Python
+1

Client Server
Charing Cross, United Kingdom
Remote
£120-140K
Senior
Go
Azure
Linux
+5