HPC Cloud Updates WE 25 May 2025
Updates to AWS, Azure & GCP in the last week relevant for HPC practitioners. There are only two hard things in Computer Science: cache invalidation and naming things… and Microsoft proves it’s still terrible at one and leaves the other to you.

AWS
This new feature could be handy to swap out grid nodes without too much interruption to workload

New instances: High memory instances (U-1) now available in Ohio
Azure
Want to know how to use Slurm + CycleCloud with a custom image? Here you go
GPU failure rates must be getting really bad 😁
Azure Managed Redis (an alternative to Azure Cache for Redis) is now GA.Wow Microsoft really suck at naming. I guess it just proves how right Phil Karlton was when he said “There are only two hard things in Computer Science: cache invalidation and naming things.” Oh and the cache invalidation is down to you 😉
Github Copilot for Azure is also GA
Google Cloud
Well we had Google Cloud I/O, I won’t even attempt to cover everything and I’m sure you’ve seen it all in other places anyway. Unless I missed it, there doesn’t seem to have been much about HPC or AI infrastructure though.