HPC Cloud Updates WE 29 Mar 2026

Updates to AWS, Azure & GCP in the last week relevant for HPC practitioners. AWS gives us updates to both PCS and the other one (Parallel Cluster). Learn how to use TPUs from Google.

HPC Cloud Updates WE 29 Mar 2026

AWS

I’m all for making HPC easier. Anything that does that is a noble effort. Even if it is the same technology that took down one of the most reliable online retailers in the world

Accelerating HPC Deployment with AWS Parallel Computing Service and Kiro CLI | Amazon Web Services
Research teams moving from on-premises HPC environments often struggle with the complexity of cloud deployment. Traditional approaches require deep expertise in AWS networking, storage architectures, and Slurm configuration management. A typical manual deployment involves weeks of infrastructure provisioning, network topology design, scheduler configuration, and performance tuning. Research teams with limited platform engineering resources find themselves […]

AWS Parallel Cluster adds support for Slurm 25.11 and P6-B300 VMs

AWS ParallelCluster 3.15 with support for P6-B300 and Slurm 25.11 - AWS
Discover more about what’s new at AWS with AWS ParallelCluster 3.15 with support for P6-B300 and Slurm 25.11

AWS Parallel Compute Services also got some updates with the ability to set slurmbd and cgroup settings

AWS Parallel Computing Service supports slurmdbd and cgroups settings - AWS
Discover more about what’s new at AWS with AWS Parallel Computing Service supports slurmdbd and cgroups settings

New Regional Instances: R8gd in California, Seoul, Hong Kong, Jakarta, Cape Town and Calgary, M8a in Ireland


Azure

Must be tired from all the hype and recovery of GTC. Nothing going on here


Google Cloud

A guide to using TPUs for your AI training workloads instead of your usual CUDA

Training large models on Ironwood TPUs | Google Cloud Blog
Learn about the key techniques and tools within the JAX and MaxText ecosystem to improve training efficiency on Ironwood TPUs.