Get the latest tech news
The real failure rate of EBS
Our experience running AWS EBS at scale for critical workloads
This is the cost of separating storage and compute and the sheer complexity of the software and networking components between the client and the backing disks for the volume. Let's assume the following: Each degradation event is random, meaning the level of reduced performance is somewhere between 1% and 89% of provisioned, and your application is designed to withstand losing 50% of its expected throughput before erroring. This doesn’t reduce the impact to zero, as it’s impossible to detect this failure before it happens, but it does ensure the majority of the cases don’t require a human to remediate and are over before users notice.
Or read this on Hacker News