HPC Cloud Updates WE 24 Nov 2024

Updates to AWS, Azure & GCP in the last week relevant for HPC practitioners

HPC Cloud Updates WE 24 Nov 2024

AWS

Need to comply with NIST SP 800-223 for your HPC workload? AWS have got a reference architecture for you

Building a secure and compliant HPC environment on AWS following NIST SP 800-223 | Amazon Web Services
Check out our latest blog post to learn how AWS enables building secure, compliant high performance computing (HPC) environments aligned with NIST SP 800-223 guidelines. We walk through the key components, security considerations, and steps for deploying a zone-based HPC architecture on AWS.

A few small things from AWS that contribute to an easier life in managing your AWS compute for HPC. Its funny the don’t link these releases together and sell it a bit more…

Faster, more accurate EC2 scaling? Yes please

Amazon EC2 Auto Scaling introduces highly responsive scaling policies - AWS
Discover more about what’s new at AWS with Amazon EC2 Auto Scaling introduces highly responsive scaling policies

Couple it with CPU performance based attribute selection

Amazon EC2 added New CPU-Performance Attribute for Instance Type Selection - AWS
Discover more about what’s new at AWS with Amazon EC2 added New CPU-Performance Attribute for Instance Type Selection

make better use of your capacity reservations too

Amazon EC2 introduces provisioning control to launch instances on On-Demand Capacity - AWS
Discover more about what’s new at AWS with Amazon EC2 introduces provisioning control to launch instances on On-Demand Capacity

and see the lineage of your AMIs

Amazon EC2 now provides lineage information for your AMIs - AWS
Discover more about what’s new at AWS with Amazon EC2 now provides lineage information for your AMIs

Individually those are small but good improvements. Coming together in the same week, during SC24 no less, and no one span them into a HPC flavoured press release? Maybe they’re all too busy writing AI content re:Invent

c7i-flex and m7i-flex now available in Malaysia

Amazon EC2 C7i-flex and M7i-flex instances are now available in AWS Asia Pacific (Malaysia) Region - AWS
Discover more about what’s new at AWS with Amazon EC2 C7i-flex and M7i-flex instances are now available in AWS Asia Pacific (Malaysia) Region

and r8g in Stockholm

Amazon EC2 R8g instances now available in AWS Europe (Stockholm) - AWS
Discover more about what’s new at AWS with Amazon EC2 R8g instances now available in AWS Europe (Stockholm)

and c6a and r6a in Hyderabad

Amazon EC2 C6a and R6a instances now available in additional AWS region - AWS
Discover more about what’s new at AWS with Amazon EC2 C6a and R6a instances now available in additional AWS region

Next gen FSx for Lustre now in California. That should give the AI bros something to be happy about.

The next generation of Amazon FSx for Lustre file systems is now available in US West (N. California) - AWS
Discover more about what’s new at AWS with The next generation of Amazon FSx for Lustre file systems is now available in US West (N. California)

Azure

I think Azure were the only cloud provide with a major announcement at SC24 this week. Introducing the HBv5

Announcing Azure HBv5 Virtual Machines: A Breakthrough in Memory Bandwidth for HPC | Microsoft Community Hub
Discover the new Azure HBv5 Virtual Machines, unveiled at Microsoft Ignite, designed for high-performance computing applications. With up to 7 TB/s of memory…

Also released this week and quite possibly just as interesting for a lot of HTC and single node HPC workloads, the Da/ Ea/ Fa v6 family

New Da/Ea/Fav6 VMs with increased performance and Azure Boost are now generally available | Microsoft Community Hub
By Sasha Melamed, Senior Product Manager, Azure Compute   We are excited to announce General Availability of new Dalsv6, Dasv6, Easv6, Falsv6, Fasv6,…
Azure updates | Microsoft Azure
Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.

Can’t believe this hasn’t had more fanfare around it. I’ve been waiting for this for a while! Azure Spot Placement Score. If you need spot capacity this will help you figure out how much you might be able to get hold before you request it

Azure updates | Microsoft Azure
Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.

Azure Compute Fleet will now support multi region. Nice.

Azure updates | Microsoft Azure
Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.

and attribute based instance selection

Azure updates | Microsoft Azure
Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.

If you’re still using Scale Sets, you can now emit custom metrics to determine the order in which machines are upgraded

Azure updates | Microsoft Azure
Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.

and also Zonal expansion

Azure updates | Microsoft Azure
Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.

Azure seem to be taking a different path to AWS and Google in the Redis/Valkey split with their new managed Redis offering

Azure updates | Microsoft Azure
Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.

Google Cloud

Google had pre-empted SC24 with news last week of all the HPC related updates they had released this year but announced the arrive of NVIDIA GB200s to Google Cloud