KubernetesIntermediatePar time: 7:00

The Volume That Won't Let Go

Database replica is down. The persistent volume is stuck on a node that no longer exists.

The Scenario

A spot instance was reclaimed by AWS 10 minutes ago. The StatefulSet rescheduled the postgres-replica-2 pod to a new node, but the pod has been stuck in ContainerCreating for 9 minutes. The PVC shows Bound but the pod events show Multi-Attach error: volume is already exclusively attached to one node. A stale VolumeAttachment object from the terminated node is preventing the EBS volume from attaching to the replacement node.

What You'll Learn

1

Why EBS volumes show Multi-Attach errors after spot instance reclamation

2

Finding and deleting stale VolumeAttachment objects that block re-scheduling

3

The difference between PVC Bound status and actual volume attachment readiness

4

Setting up spot instance drain handlers to avoid stale attachment objects

Tools You'll Use

kubectlVolumeAttachment objectsPVC describeAWS EBS events

Real-World Context

Stale VolumeAttachment objects after spot instance reclamation are a known AWS EBS + Kubernetes failure mode. The fix is simple once you know what to look for, but the symptom - a healthy-looking PVC that still won't attach - is extremely confusing.

Ready to debug this?

Free account required - sign up with GitHub or Google in 10 seconds

Play The Volume That Won't Let Go