<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:media="http://search.yahoo.com/mrss/"><channel><title><![CDATA[Flux by HMx Labs]]></title><description><![CDATA[High Performance Computing | Data]]></description><link>https://cloudhpc.news/</link><image><url>https://cloudhpc.news/favicon.png</url><title>Flux by HMx Labs</title><link>https://cloudhpc.news/</link></image><generator>Ghost 5.80</generator><lastBuildDate>Sat, 02 May 2026 04:09:02 GMT</lastBuildDate><atom:link href="https://cloudhpc.news/rss/" rel="self" type="application/rss+xml"/><ttl>60</ttl><item><title><![CDATA[Kubernetes’ Place in a HPC World]]></title><description><![CDATA[<p><em>Does using Kubernetes to run HPC workloads really make sense?</em></p><p>I have an opinion that I don&#x2019;t usually share online for fear of being ridiculed. But yesterday I shared it with a stranger and lived. Plus, its Friday so even if I&#x2019;m totally wrong on this</p>]]></description><link>https://cloudhpc.news/kubernetes-place-in-a-hpc-world/</link><guid isPermaLink="false">69f43c2c1b3880bdc6b823ff</guid><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Fri, 01 May 2026 07:00:55 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/05/HPC-doesn-t-belong-on-k8-change-my-mind.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/05/HPC-doesn-t-belong-on-k8-change-my-mind.jpg" alt="Kubernetes&#x2019; Place in a HPC World"><p><em>Does using Kubernetes to run HPC workloads really make sense?</em></p><p>I have an opinion that I don&#x2019;t usually share online for fear of being ridiculed. But yesterday I shared it with a stranger and lived. Plus, its Friday so even if I&#x2019;m totally wrong on this everyone will have forgotten by Monday anyway. I don&#x2019;t think Kubernetes should be used to run HPC workloads.</p><p>I&#x2019;ll admit, it has been a number of years now since I last did this in anger at any kind of scale and I&#x2019;m certain that K8 has evolved considerably since then. We&#x2019;ve definitely had things like Slinky released to make it easier too. Even so, I don&#x2019;t see a compelling reason for the additional complexity a K8 cluster brings when all you really care about is access to compute. It only gets worse when you have to pass through any kind of specialised hardware.</p><p>I think at some point in time, it made sense because K8 became a defacto way to access cloud capacity in something close to a vendor agnostic way. I get that. I don&#x2019;t think it&#x2019;s a good reason anymore. While the clouds may not have a common API there is more than one commercial product and open source project that let you work around those limitations. Even without those, having to deal with two or three APIs to manage resources is really not that much work. Especially in an era where your favourite genie will whip that up in the blink of an eye.</p><p>Conversely, I think from a cloud provider&#x2019;s point of view, I wonder if it makes it easier to then pull you along the path to additional cloud native products. And we all know cloud native was just a great marketing term for cloud locked right? &#x1F601;</p><p>Change my mind.</p>]]></content:encoded></item><item><title><![CDATA[HPC Release Notes Automation]]></title><description><![CDATA[Automating the weekly HPC release notes process which means it can get more comprehensive!]]></description><link>https://cloudhpc.news/hpc-release-notes-automation/</link><guid isPermaLink="false">69f1984e1b3880bdc6b82375</guid><category><![CDATA[Random Bytes]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Wed, 29 Apr 2026 07:00:21 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/tinder-for-hpc.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/tinder-for-hpc.jpg" alt="HPC Release Notes Automation"><p>For roughly two years I&#x2019;ve been putting together release notes from the major cloud providers that are relevant to people that care about HPC. By Hand. Like some kind of caveman. Every week I&#x2019;d read the release notes and pick out the bits that are HPC relevant.</p><p>I did try and automate it a couple of times, for some definition of try. Yea ok I just told Claude and ChatGPT to do it in a web chat window. I didn&#x2019;t try very hard. Anyway, that never seemed to work very well. Mostly I seem to struggle getting it to correctly identify what might be HPC relevant.</p><p>Last weekend though I tried a bit harder. I AI first engineered, nah who am I kidding, I straight up vibe coded a web app that reads the RSS feeds from the cloud providers, attempts to classify them, mostly fails and gives me a way to swipe left or right on each release note so you still get a nice, curated list.</p><p>It needs a little (maybe a lot?) more work before I put it live and I&#x2019;m sort of playing with writing my own ML classifier based on the last two years of data rather than using an LLM.</p><p>Don&#x2019;t worry, one of the items on the to-do list is a way for me to add the odd snarky comment against the releases like you get in todays version &#x1F606;</p><p>Now that this process is a bit more scale-able, not only can I add a more clouds and possibly neo clouds to the RSS list, but other vendors too that produce HPC relevant products. Well so long as they have an RSS feed, I guess. So, who else should I add?&#xA0;</p><p>More importantly, it will need a name. Suggestions?</p>]]></content:encoded></item><item><title><![CDATA[Quantum #109]]></title><description><![CDATA[Issue #109 of the weekly HPC newsletter by HMx Labs. Cerberas files for an IPO, Google gives us TPU v8, Meta adopts Graviton, Bolt Graphics tapes out their first GPU and we try and make workloads a tiny bit more portable.]]></description><link>https://cloudhpc.news/quantum-109/</link><guid isPermaLink="false">69ee78621b3880bdc6b8229a</guid><category><![CDATA[Quantum Weekly]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Mon, 27 Apr 2026 09:00:44 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/quantum-109.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/quantum-109.jpg" alt="Quantum #109"><p>Cerberas decided to attempt another IPO last week while Bolt Graphics taped out its first GPU in an attempt to reclaim some of the vacant space in the FP64 space left by Nvidia&#x2019;s all while Google also announced its 8<sup>th</sup>generation TPUs. The market for accelerated computing certainly seems to be a lot diverse than it ever has been in the past and it will be interesting to see how many, if any, of these alternatives to Nvidia&#x2019;s CUDA ecosystem manage to gain a significant foothold.</p><p>&#xA0;I would say the choice is nice to have, but the reality is we can barely even swap across CPU architectures in many domains (more on this in a bit) let alone across the significantly more difficult accelerated computing domains. For now the choice may possibly give us a more competitive market (and even then I&#x2019;m not so sure) but for most people it will still mean picking a direction and sticking with it for a number of years if not decades.</p><p>The timing of Cerberas&#x2019; IPO is also interesting, being as it would, the first major AI IPO this year. Are they trying to get out ahead of OpenAI and Anthropic who leave about as much liquidity as there is in the Sahara?</p><p>Meta announced a partnership with AWS and adoption of Graviton CPUs amidst increasing chatter that we&#x2019;re running short of CPU capacity too now. Allegedly this is because agentic workflows demand more CPU capacity and are seeing GPU to CPU ratios collapse to close to 1:1 instead of the current 2:1 or even 4:1. I don&#x2019;t buy it. At least not for CPUs collocated with GPUs. It simply makes no sense to burden a GPU machine with anything other GPU workloads. You&#x2019;d architect any system at scale to offload CPU only compute elsewhere. And quite frankly I don&#x2019;t buy that it&#x2019;s even CPU bound at all. It&#x2019;s more than likely IO bound. I&#x2019;m sure Intel/AMD/ARM would love to make more money on their CPUs, and this is a useful narrative, but I don&#x2019;t for a minute think it&#x2019;s true. At least not for the reasons being proposed.</p><p>In our own news, we&#x2019;ve moved the needle a little in making compute workloads more portable both across clouds and CPU architectures last week. Well technically it happened over a longer period, but we talked about it last week! Checkout FINOS ORB and our work on automating resolving numerical stability challenges when moving CPU architectures.</p><hr><h2 id="in-the-news">In The News</h2><p>Updates from the big three clouds on all things HPC.</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/hpc-cloud-updates-we-26-apr-2026/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">HPC Cloud Updates WE 26 Apr 2026</div><div class="kg-bookmark-description">Updates to AWS, Azure &amp; GCP in the last week relevant for HPC practitioners. PCS supports Slurm 25.11, Azure gives you NetApp ransomware scanning and Google Cloud Next Updates.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #109"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-26.jpg" alt="Quantum #109"></div></a></figure><p>Bolt Graphics provides an FP64 alternative to Nvidia GPUs</p><p><a href="https://www.hpcwire.com/2026/04/22/bolt-graphics-targets-fp64-hpc-workloads-with-zeus-gpu/?ref=cloudhpc.news">https://www.hpcwire.com/2026/04/22/bolt-graphics-targets-fp64-hpc-workloads-with-zeus-gpu/</a></p><p>Cerberas tries for IPO again. First of the AI IPOs this year?</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://www.nextplatform.com/compute/2026/04/22/the-second-time-will-be-the-ipo-charm-for-cerebras/5218651?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">The Second Time Will Be The IPO Charm For Cerebras</div><div class="kg-bookmark-description">Waferscale
chip pioneer and AI systems maker Cerebras Systems filed
to go public back in September 2&#x2026;</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.nextplatform.com/view-resources/dachser2/public/nextplatform/favicons/apple-touch-icon.png" alt="Quantum #109"><span class="kg-bookmark-author">nextplatform</span><span class="kg-bookmark-publisher">Timothy Prickett Morgan</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://image.nextplatform.com/5218653.jpg?imageId=5218653&amp;x=0&amp;y=0&amp;cropw=100&amp;croph=100&amp;panox=0&amp;panoy=0&amp;panow=100&amp;panoh=100&amp;width=1200&amp;height=683" alt="Quantum #109"></div></a></figure><p>We need more CPUs for AI? I don&#x2019;t buy it. This is a lazy analysis</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://www.tomshardware.com/pc-components/cpus/shifting-need-for-cpus-in-ai-workloads-drives-intensifying-shortages-price-hikes?utm_source=chatgpt.com"><div class="kg-bookmark-content"><div class="kg-bookmark-title">CPU requirements for AI workloads are multiplying, driving intensifying shortages and price hikes &#x2014; Intel already shifting production from consumer chips to Xeon as inference workloads drive server CPU ratios back toward parity with GPUs</div><div class="kg-bookmark-description">Intel says CPU-to-GPU deployment ratios have tightened from 1:8 to 1:4, and could reach 1:1.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://vanilla.futurecdn.net/tomshardware/1614069/apple-touch-icon.png" alt="Quantum #109"><span class="kg-bookmark-author">Tom&apos;s Hardware</span><span class="kg-bookmark-publisher">Luke James</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn.mos.cms.futurecdn.net/B22Y35c6WAhjUoKp7cjvsd-2560-80.jpg" alt="Quantum #109"></div></a></figure><p>Meta still on a spending spree for hardware, this time Graviton CPUs</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://about.fb.com/news/2026/04/meta-partners-with-aws-on-graviton-chips-to-power-agentic-ai/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Meta Partners With AWS on Graviton Chips to Power Agentic AI</div><div class="kg-bookmark-description">We&#x2019;re announcing an agreement with AWS to bring tens of millions of AWS Graviton cores into Meta&#x2019;s compute portfolio to support agentic AI workloads.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://about.fb.com/wp-content/uploads/2021/10/meta-favicon.png?fit=16%2C16" alt="Quantum #109"><span class="kg-bookmark-author">Meta Newsroom</span><span class="kg-bookmark-publisher">heathera</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://about.fb.com/wp-content/uploads/2026/04/Partnership-Announcement_Header.jpg?w=1200" alt="Quantum #109"></div></a></figure><hr><h2 id="from-hmx-labs">From HMx Labs</h2><p>The end game is to make HPC workloads more portable, so we had a couple of small wins in that space last week. Progress in running across different clouds and CPU architectures.</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/finos-orb-one-cloud-api-to-rule-them-all/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">FINOS ORB: One Cloud API to Rule Them All?</div><div class="kg-bookmark-description">FINOS announced the Open Resource Broker yesterday, an adaptive layer to allow HPC schedulers to control cloud capacity designed to work across multiple clouds and schedulers. We&#x2019;re all on in and are helping to create it.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #109"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/ORB.jpg" alt="Quantum #109"></div></a></figure><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/testing-for-numerical-stability-and-can-ai-help-fix-it/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Testing for Numerical Stability and Can AI Help Fix It</div><div class="kg-bookmark-description">Can LLMs fix numerical stability problems in your code. Let&#x2019;s design a little experiment to find out and how much would you just like a way to just validate your numerical stability across CPU generations and architectures.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #109"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/numerical-stablity-scan.jpg" alt="Quantum #109"></div></a></figure><hr><p>Know someone else who might like to read this newsletter? Forward this on to them or even better, ask them to sign up here:&#xA0;<a href="https://cloudhpc.news/">https://cloudhpc.news</a></p>]]></content:encoded></item><item><title><![CDATA[HPC Cloud Updates WE 26 Apr 2026]]></title><description><![CDATA[Updates to AWS, Azure & GCP in the last week relevant for HPC practitioners. PCS supports Slurm 25.11, Azure gives you NetApp ransomware scanning and Google Cloud Next Updates.]]></description><link>https://cloudhpc.news/hpc-cloud-updates-we-26-apr-2026/</link><guid isPermaLink="false">69ee13201b3880bdc6b8227a</guid><category><![CDATA[HPC Cloud Release Notes]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Sun, 26 Apr 2026 13:32:00 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-26.jpg" medium="image"/><content:encoded><![CDATA[<h2 id="aws">AWS</h2><img src="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-26.jpg" alt="HPC Cloud Updates WE 26 Apr 2026"><p>Sagemaker optimises your Slurm topology automatically now</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-sagemaker-hyperpod-automatic-slurm-topology/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Amazon SageMaker HyperPod now supports automatic Slurm topology management - AWS</div><div class="kg-bookmark-description">Discover more about what&#x2019;s new at AWS with Amazon SageMaker HyperPod now supports automatic Slurm topology management</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/libra-css/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services, Inc.</span></div></div></a></figure><p>PCS Support for Slurm 25.11</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/about-aws/whats-new/2026/04/aws-pcs-slurm-25-11/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">AWS Parallel Computing Service now supports Slurm 25.11 - AWS</div><div class="kg-bookmark-description">Discover more about what&#x2019;s new at AWS with AWS Parallel Computing Service now supports Slurm 25.11</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/libra-css/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services, Inc.</span></div></div></a></figure><p>Relying on AWS Compute Optimizer to do the heavy work figuring out if you&#x2019;re using the right kind of EC2 instance? It now supports a lot ore instance types&#x2026; but I have an even better answer for you</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/about-aws/whats-new/2026/04/aws-compute-optimizer-ec2-rds/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">AWS Compute Optimizer supports 162 new EC2 instance types and 32 new RDS DB instance classes - AWS</div><div class="kg-bookmark-description">Discover more about what&#x2019;s new at AWS with AWS Compute Optimizer supports 162 new EC2 instance types and 32 new RDS DB instance classes</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/libra-css/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services, Inc.</span></div></div></a></figure><p>This sounds really promising but I do wonder how effective it is in reality.&#xA0;</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/about-aws/whats-new/2026/04/sagemaker-ai-inference-rec/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Amazon SageMaker AI launches optimized generative AI inference recommendations - AWS</div><div class="kg-bookmark-description">Discover more about what&#x2019;s new at AWS with Amazon SageMaker AI launches optimized generative AI inference recommendations</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/libra-css/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services, Inc.</span></div></div></a></figure><p>New Regional Instances:&#xA0;<a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ec2-c8i-flex-instances-europe-ireland-europe-london-asia-pacific-new-zealand-regions/?ref=cloudhpc.news" rel="noreferrer">C8i-flex now in Ireland, London and New Zealand</a>, <a href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-ec2-c8i-instances-europe-ireland-asia-pacific-new-zealand-regions/?ref=cloudhpc.news" rel="noreferrer">C8i in Ireland and New Zealand</a>, <a href="https://aws.amazon.com/about-aws/whats-new/2026/04/ec2-g7e-instances-local-zones/?ref=cloudhpc.news" rel="noreferrer">G7e in Los Angeles</a></p><hr><h2 id="azure">Azure</h2><p>If you&#x2019;re using NetApp for your HPC file system you can now scan for ransomware to slow down your compute!</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://azure.microsoft.com/en-gb/updates/?id=560188&amp;ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Azure updates | Microsoft Azure</div><div class="kg-bookmark-description">Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://azure.microsoft.com/favicon.ico?v2" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Microsoft Azure</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn-dynmedia-1.microsoft.com/is/image/microsoftcorp/ACOM-Roadmap-hero?resMode=sharp2&amp;op_usm=1.5,0.65,15,0&amp;wid=1600&amp;hei=635&amp;qlt=85&amp;fit=constrain" alt="HPC Cloud Updates WE 26 Apr 2026"></div></a></figure><hr><h2 id="google-cloud">Google Cloud</h2><p>8<sup>th</sup>&#xA0;Gen TPUs for your AI workloads</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloud.google.com/blog/products/compute/tpu-8t-and-tpu-8i-technical-deep-dive?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">TPU 8t and TPU 8i technical deep dive | Google Cloud Blog</div><div class="kg-bookmark-description">The 8th generation TPUs are engineered with system-level co-design to accelerate the AI lifecycle. TPU 8t is built for frontier-model training and TPU 8i is built for large-scale inference and reinforcement learning.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.gstatic.com/cloud/images/icons/favicon.ico" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Google Cloud</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://storage.googleapis.com/gweb-cloudblog-publish/images/eighth-generation_TPU.max-2000x2000.jpg" alt="HPC Cloud Updates WE 26 Apr 2026"></div></a></figure><p>Details on scaling compute (CPU and GPU)</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloud.google.com/blog/products/compute/whats-new-in-compute-at-next26?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">What&#x2019;s new in compute at Next &#x2019;26 | Google Cloud Blog</div><div class="kg-bookmark-description">Learn how to scale web servers, databases, and enterprise applications to fuel your AI agents with the latest offers on Google Compute Engine.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.gstatic.com/cloud/images/icons/favicon.ico" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Google Cloud</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://storage.googleapis.com/gweb-cloudblog-publish/images/GCN26_102_BlogHeader_2436x1200_Opt_12_Dark.max-2500x2500.jpg" alt="HPC Cloud Updates WE 26 Apr 2026"></div></a></figure><p>Google&#x2019;s tips on cross cloud</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloud.google.com/blog/products/compute/cross-cloud-infrastructure-at-next26?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Cross-cloud infrastructure at Next &#x2019;26 | Google Cloud Blog</div><div class="kg-bookmark-description">Next &#x2019;26 cross-cloud infrastructure enhancements in four areas: fluid compute, connectivity, a unified data layer, and digital sovereignty.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.gstatic.com/cloud/images/icons/favicon.ico" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Google Cloud</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://storage.googleapis.com/gweb-cloudblog-publish/images/GCN26_102_BlogHeader_2436x1200_Opt_4_Light.max-2500x2500.jpg" alt="HPC Cloud Updates WE 26 Apr 2026"></div></a></figure><p>A recap of everything from Cloud Next. I had a scan through and there are a few that are HPC relevant but not new from last week.</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloud.google.com/blog/topics/google-cloud-next/google-cloud-next-2026-wrap-up?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Google Cloud Next 2026 Wrap Up | Google Cloud Blog</div><div class="kg-bookmark-description">A whirlwind recap of Google Cloud Next 26, including a synopsis of over 250 product, customer and ecosystem announcements.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.gstatic.com/cloud/images/icons/favicon.ico" alt="HPC Cloud Updates WE 26 Apr 2026"><span class="kg-bookmark-author">Google Cloud</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://storage.googleapis.com/gweb-cloudblog-publish/images/GCNEXT2026_0422_090309-3826_ALIVE.max-2600x2600.jpg" alt="HPC Cloud Updates WE 26 Apr 2026"></div></a></figure>]]></content:encoded></item><item><title><![CDATA[Testing for Numerical Stability and Can AI Help Fix It]]></title><description><![CDATA[Can LLMs fix numerical stability problems in your code. Let’s design a little experiment to find out and how much would you just like a way to just validate your numerical stability across CPU generations and architectures.]]></description><link>https://cloudhpc.news/testing-for-numerical-stability-and-can-ai-help-fix-it/</link><guid isPermaLink="false">69eb0bfd1b3880bdc6b82212</guid><category><![CDATA[Random Bytes]]></category><category><![CDATA[AI]]></category><category><![CDATA[Financial Services]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Fri, 24 Apr 2026 07:00:54 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/numerical-stablity-scan.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/numerical-stablity-scan.jpg" alt="Testing for Numerical Stability and Can AI Help Fix It"><p>I think I&#x2019;m bored with trying to vibe code a HPC scheduler. I do have another, probably more useful and potentially more feasible use case for your little AI friend though. Numerical Instability.</p><p>I don&#x2019;t think it&#x2019;s possibly to specify in sufficient detail the test cases (even if you vibe code the tests themselves) when a system becomes as complex as the HPC compute and data scheduling system I was attempting. Without those test cases it&#x2019;s impossible to have the validation step in the AI generation loop. Without that validation step you&#x2019;ll never have output that works. Plus, I just&#xA0;<strong>keep&#xA0;</strong>running out of tokens!&#xA0;</p><p>Skill issue? Maybe. Mythos required? We&#x2019;ll see. I don&#x2019;t think I&#x2019;m on Anthropic&#x2019;s special list though. Oh yea, all I need to do is change the model name in the API call anyway &#x1F923;</p><p>I haven&#x2019;t totally given up; I think I will come back to this when I have a little more time and perhaps money to blow on the experiment. Till then, I have something that might actually work.</p><p>If your little LLM pet can scan your code for security vulnerabilities, I wonder if it can also scan your code for numerical stability issues.</p><p>I&#x2019;ve created a test case using QuantLib to price a digital barrier equity option using a Monte Carlo engine. It&#x2019;s stable across multiple runs on the same machine (i.e. MC stable) but it does give me small but significant differences when run across different CPU architectures.</p><p>The next step is to create that all important automated verification. I have a couple of choices here and I&#x2019;d like a little input from you. I could just have a couple of machines that I let run Claude run wild on with SSH access and see what it does and just compare numbers coming out the end. Cheap to do. Not very useful once it&#x2019;s done. Or I could take the engine behind FLOPx that runs all our cloud benchmarks and change it every so slightly so that it can it runs a numerical stability test instead. That would give me a reusable, way to plug in any code to be run across multiple cloud VMs and then validate the answers from each are the same.</p><p>In the same way that we can now confidently and consistently benchmark cloud VMs at scale, this would allow us to validate numerical stability across multiple CPU generations and architectures. For the cost of running a few spot VMs.</p><p>In either case, I can then create a harness that attempts to find (and fix) the numerical stability problem &#x2013; perhaps I can create a skill that allows it to use our Intel PIN tool extensions to help &#x2013; using the aforementioned validation in a Karpathy style auto research loop and let it get on with it while I sleep. Might still need mountain of tokens though.</p><p>My question to you is: Do you care about having a numerical stability verification system? Or shall just write a bash script comparing two numbers and let Claude run rampant on two or three sandboxes.</p>]]></content:encoded></item><item><title><![CDATA[FINOS ORB: One Cloud API to Rule Them All?]]></title><description><![CDATA[FINOS announced the Open Resource Broker yesterday, an adaptive layer to allow HPC schedulers to control cloud capacity designed to work across multiple clouds and schedulers. We’re all on in and are helping to create it.]]></description><link>https://cloudhpc.news/finos-orb-one-cloud-api-to-rule-them-all/</link><guid isPermaLink="false">69e863501b3880bdc6b821a2</guid><category><![CDATA[Random Bytes]]></category><category><![CDATA[HPC]]></category><category><![CDATA[Financial Services]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Wed, 22 Apr 2026 07:00:46 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/ORB.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/ORB.jpg" alt="FINOS ORB: One Cloud API to Rule Them All?"><p>Ever wanted a single unified interface to all the clouds? One API to rule them all? Yea me too. We just got a small step closer to that being reality.</p><p>Yesterday FINOS (part of the Linux Foundation, focused on financial services) <a href="https://www.finos.org/press/banks-and-tech-providers-collaborate-on-open-fabric-for-high-performance-computing?utm_campaign=Press%20Release%20MAIN&amp;utm_content=376045894&amp;utm_medium=social&amp;utm_source=linkedin&amp;hss_channel=lcp-18473937" rel="noreferrer">announced the Open Resource Broker</a>. An adapter designed to allow HPC schedulers (or really anything) to provision compute capacity on clouds. It has been contributed jointly by Morgan Stanley and AWS, thank you kindly both!</p><p>Before you roll over and conclude that this is just another AWS ploy to get you to use their cloud or even that this is only for Wall Street types, I want to say something that I hope will change your mind.</p><p>HMx Labs are all in on this too. In fact, we&#x2019;ve already written most of the code needed so that ORB will not only work with AWS but Azure and Google Cloud too. From near enough day one.&#xA0;</p><p>I also hope I&#x2019;ve built enough credibility with you dear reader that despite having a finance background you&#x2019;ll believe me when I say that there is nothing in here that prevents anyone, anywhere from using it with whichever HPC scheduler you want. Or None. Just ask for cloud capacity without a scheduler if you like.</p><p>This is the internet, so I know you&#x2019;re going to ask for receipts. Give me shout. I&#x2019;m happy to talk.</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://www.finos.org/press/banks-and-tech-providers-collaborate-on-open-fabric-for-high-performance-computing?utm_campaign=Press%20Release%20MAIN&amp;utm_content=376045894&amp;utm_medium=social&amp;utm_source=linkedin&amp;hss_channel=lcp-18473937"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Major Banks, Cloud Providers, and Technology Leaders Announce Unified Open Source Fabric for Next-generation High-performance Computing (HPC) to Enable Faster Decisions and Breakthrough Efficiency</div><div class="kg-bookmark-description">The FINOS community, including members Citi, Morgan Stanley, RBC, AWS, and Oracle, is advancing open HPC initiatives that deliver faster, smarter, more accessible, and drastically more efficient compute technologies for the financial services industry, through a unified, hybrid, and AI-ready fabric.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.finos.org/hubfs/FINOS/finos-logo/favicon.ico" alt="FINOS ORB: One Cloud API to Rule Them All?"><span class="kg-bookmark-author">FINOS</span><span class="kg-bookmark-publisher">FINOS Team</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://www.finos.org/hubfs/Press%20Releases/Major%20banks%2c%20cloud%20providers%2c%20and%20technology%20leaders%20announce%20unified%20open%20source%20fabric%20for%20next-generation%20high-performance%20computing%20(HPC)%20to%20enable%20faster%20decisions%20and%20breakthrough%20efficiency.png" alt="FINOS ORB: One Cloud API to Rule Them All?"></div></a></figure>]]></content:encoded></item><item><title><![CDATA[Quantum #108]]></title><description><![CDATA[Issue #108 of the weekly HPC newsletter by HMx Labs. Slurm is still in the limelight with US Gov finally wondering Nvidia’s acquisition will affect national supercomputers. Data for AI might need another look beyond just scaping the internet and sovereign is still a thing it seems.]]></description><link>https://cloudhpc.news/quantum-108/</link><guid isPermaLink="false">69e50af01b3880bdc6b82106</guid><category><![CDATA[Quantum Weekly]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Mon, 20 Apr 2026 09:00:03 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/quantum-108.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/quantum-108.jpg" alt="Quantum #108"><p>It appears the US Government suddenly woke up about 4 months late and realised that Nvidia drinking all the Slurm might affect their national supercomputers. Senator Warren has written to the DOE about his but might not get the alarmed response she&#x2019;s hoping for given that El-Capitan and most of the other largest supercomputers under the DOE now run on Flux Framework. Just as well really as they&#x2019;re also based on AMD silicon. Coincidentally (or maybe not!?) Todd Gamblin from the Lawrence Livermore National Laboratory was kind enough to speak at the Google Cloud Advanced Computing Community last week too.</p><p>A lot of the news last week was around using AI to accelerate quantum computing. Nothing like powering your AI hype train with some quantum fuel right? And I think I&#x2019;ll stop there before I say something that upsets people.</p><p>The slightly more interesting bit of news that caught my eye though was a piece from HPC Wire on data lakes for scientific data. The idea being to retain a lot more experimental output data than was traditionally ever done so that it can be used in AI models. I think the idea is actually broader than just the scientific community. In a world where potentially almost anything could form useful training data, retaining, categorising and cataloguing that data is suddenly useful. Important even.</p><p>This applies to so much that we traditionally ignored. Multiple categories of enterprise data, intermediate calculation results that were often used for debugging or compute optimisation but never kept long term (I&#x2019;m thinking financial risk analytics here). Even telemetry data from production systems.</p><p>I think this is a space that hasn&#x2019;t really been explored. HPC and AI data planes are pretty dumb still. They might be bigger and faster but they&#x2019;re still not really any smarter. Getting the right data, from the right, authoritative source, from the nearest possible location. Yea that&#x2019;s still not happening. Not even close. Agentic access to data is going to be pretty meaningless if the human still has to figure out where to get the data from. That&#x2019;s 80% of the problem most of the time!&#xA0;</p><p>Lastly the idea of sovereign HPC (funny how much more normal it seems when you don&#x2019;t call it AI) hasn&#x2019;t really gone anywhere. Both Canada and Euro HPC made a bit of noise and advances in this space last week.</p><hr><h2 id="in-the-news">In The News</h2><p>Updates from the big three clouds on all things HPC.</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/hpc-cloud-updates-we-19-apr-2026/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">HPC Cloud Updates WE 19 Apr 2026</div><div class="kg-bookmark-description">Updates to AWS, Azure &amp; GCP in the last week relevant for HPC practitioners. Todd Gamblin of LLNL talks AI integration with the folk from Google. AWS embraces multi cloud. Azure lets you bring your own keys.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #108"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-19.jpg" alt="Quantum #108"></div></a></figure><p>Did US Gov just notice Nvidia drinking all the Slurm?</p><p><a href="https://www.reuters.com/technology/us-senator-warren-voices-concern-over-nvidias-acquisition-slurm-2026-04-15/?ref=cloudhpc.news">https://www.reuters.com/technology/us-senator-warren-voices-concern-over-nvidias-acquisition-slurm-2026-04-15/</a></p><p>What&#x2019;s the best thing to do to keep the AI hype train running, fuel it with quantum of course</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://www.nextplatform.com/compute/2026/04/14/nvidia-brings-the-power-of-open-source-ai-models-to-quantum-computing/5217571?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Nvidia Brings The Power Of Open Source AI Models To Quantum Computing</div><div class="kg-bookmark-description">For quantum
computing to reach the point where it is fault-tolerant, scalable, and
commercially viab&#x2026;</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.nextplatform.com/view-resources/dachser2/public/nextplatform/favicons/apple-touch-icon.png" alt="Quantum #108"><span class="kg-bookmark-author">nextplatform</span><span class="kg-bookmark-publisher">Jeff Burt</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://image.nextplatform.com/5217573.jpg?imageId=5217573&amp;x=0&amp;y=0&amp;cropw=100&amp;croph=100&amp;panox=0&amp;panoy=0&amp;panow=100&amp;panoh=100&amp;width=1200&amp;height=683" alt="Quantum #108"></div></a></figure><p>What does data mean in an AI age? How much more do we need to keep? How much more meta data do we need for it to make sense? And how do we catalogue, categorise and manage all of it?</p><p><a href="https://www.hpcwire.com/2026/04/17/the-rise-of-experimental-data-lakes/?ref=cloudhpc.news">https://www.hpcwire.com/2026/04/17/the-rise-of-experimental-data-lakes/</a></p><p>Sovereign compute capacity hasn&#x2019;t gone away</p><p><a href="https://www.hpcwire.com/aiwire/2026/04/16/canada-opens-applications-for-ai-supercomputing-infrastructure-program/?ref=cloudhpc.news">https://www.hpcwire.com/aiwire/2026/04/16/canada-opens-applications-for-ai-supercomputing-infrastructure-program/</a></p><p><a href="https://www.hpcwire.com/2026/04/17/the-rise-of-experimental-data-lakes/?ref=cloudhpc.news">https://www.hpcwire.com/2026/04/17/the-rise-of-experimental-data-lakes/</a></p><hr><h1 id="from-hmx-labs">From HMx Labs</h1><p>I&#x2019;ve been a little too busy to write anything interesting and am also trying to focus on a couple of longer articles including on using AI within HPC and financial services.&#xA0;</p><p>Here&#x2019;s a little amusement to keep you occupied instead</p><figure class="kg-card kg-image-card"><img src="https://cloudhpc.news/content/images/2026/04/HPC-I-am-your-father.jpg" class="kg-image" alt="Quantum #108" loading="lazy" width="1000" height="816" srcset="https://cloudhpc.news/content/images/size/w600/2026/04/HPC-I-am-your-father.jpg 600w, https://cloudhpc.news/content/images/2026/04/HPC-I-am-your-father.jpg 1000w" sizes="(min-width: 720px) 720px"></figure><hr><p>Know someone else who might like to read this newsletter? Forward this on to them or even better, ask them to sign up here:&#xA0;<a href="https://cloudhpc.news/">https://cloudhpc.news</a></p>]]></content:encoded></item><item><title><![CDATA[HPC Cloud Updates WE 19 Apr 2026]]></title><description><![CDATA[Updates to AWS, Azure & GCP in the last week relevant for HPC practitioners. Todd Gamblin of LLNL talks AI integration with the folk from Google. AWS embraces multi cloud. Azure lets you bring your own keys.]]></description><link>https://cloudhpc.news/hpc-cloud-updates-we-19-apr-2026/</link><guid isPermaLink="false">69e4f51d1b3880bdc6b820ed</guid><category><![CDATA[HPC Cloud Release Notes]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Sun, 19 Apr 2026 15:34:08 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-19.jpg" medium="image"/><content:encoded><![CDATA[<h2 id="aws">AWS</h2><img src="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-19.jpg" alt="HPC Cloud Updates WE 19 Apr 2026"><p>Want multi cloud HPC? One step closer with AWS Interconnect now GA</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/blogs/aws/aws-interconnect-is-now-generally-available-with-a-new-option-to-simplify-last-mile-connectivity/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">AWS Interconnect is now generally available, with a new option to simplify last-mile connectivity | Amazon Web Services</div><div class="kg-bookmark-description">Today, we&#x2019;re announcing the general availability of AWS Interconnect &#x2013; multicloud, a managed private connectivity service that connects your Amazon Virtual Private Cloud (Amazon VPC) directly to VPCs on other cloud providers. We&#x2019;re also introducing AWS Interconnect &#x2013; last mile, a new capability that simplifies how you establish high-speed, private connections to AWS from your [&#x2026;]</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/main/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 19 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services</span><span class="kg-bookmark-publisher">S&#xE9;bastien Stormacq</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/14/Option-2-2.png" alt="HPC Cloud Updates WE 19 Apr 2026"></div></a></figure><p>New network and block storage optimised cousins of the C8i are now available</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-ec2-c8in-c8ib-instances-ga/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Introducing Amazon EC2 C8in and C8ib instances - AWS</div><div class="kg-bookmark-description">Discover more about what&#x2019;s new at AWS with Introducing Amazon EC2 C8in and C8ib instances</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/libra-css/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 19 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services, Inc.</span></div></div></a></figure><p>Of course if you&#x2019;re going to get that, then the Graviton versions need a performance boost to keep you interested</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/about-aws/whats-new/2026/04/ec2-c8gn-m8gn-r8gn-ebs/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Amazon EC2 C8gn, M8gn, and R8gn instances now support higher Amazon EBS-optimized performance - AWS</div><div class="kg-bookmark-description">Discover more about what&#x2019;s new at AWS with Amazon EC2 C8gn, M8gn, and R8gn instances now support higher Amazon EBS-optimized performance</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/libra-css/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 19 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services, Inc.</span></div></div></a></figure><hr><h2 id="azure">Azure</h2><p>Users filling up their NetApp shares again? At least now you know who to blame</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://azure.microsoft.com/en-gb/updates/?id=558483&amp;ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Azure updates | Microsoft Azure</div><div class="kg-bookmark-description">Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://azure.microsoft.com/favicon.ico?v2" alt="HPC Cloud Updates WE 19 Apr 2026"><span class="kg-bookmark-author">Microsoft Azure</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn-dynmedia-1.microsoft.com/is/image/microsoftcorp/ACOM-Roadmap-hero?resMode=sharp2&amp;op_usm=1.5,0.65,15,0&amp;wid=1600&amp;hei=635&amp;qlt=85&amp;fit=constrain" alt="HPC Cloud Updates WE 19 Apr 2026"></div></a></figure><p>HBv2 is oooold. You really shouldn&#x2019;t be using it anymore and know Azure won&#x2019;t let you include them in Azure Batch Pools either</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://azure.microsoft.com/en-gb/updates/?id=559751&amp;ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Azure updates | Microsoft Azure</div><div class="kg-bookmark-description">Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://azure.microsoft.com/favicon.ico?v2" alt="HPC Cloud Updates WE 19 Apr 2026"><span class="kg-bookmark-author">Microsoft Azure</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn-dynmedia-1.microsoft.com/is/image/microsoftcorp/ACOM-Roadmap-hero?resMode=sharp2&amp;op_usm=1.5,0.65,15,0&amp;wid=1600&amp;hei=635&amp;qlt=85&amp;fit=constrain" alt="HPC Cloud Updates WE 19 Apr 2026"></div></a></figure><p>Bring your own key now works for Premium SSD v2 and Ultra Disks</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://azure.microsoft.com/en-gb/updates/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Azure updates | Microsoft Azure</div><div class="kg-bookmark-description">Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://azure.microsoft.com/favicon.ico?v2" alt="HPC Cloud Updates WE 19 Apr 2026"><span class="kg-bookmark-author">Microsoft Azure</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn-dynmedia-1.microsoft.com/is/image/microsoftcorp/ACOM-Roadmap-hero?resMode=sharp2&amp;op_usm=1.5,0.65,15,0&amp;wid=1600&amp;hei=635&amp;qlt=85&amp;fit=constrain" alt="HPC Cloud Updates WE 19 Apr 2026"></div></a></figure><p>If you&#x2019;re trying to burst to cloud and using Azure Files then Azure File Sync might be handy and is available in a few more places now</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://azure.microsoft.com/en-gb/updates/?id=557828&amp;ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Azure updates | Microsoft Azure</div><div class="kg-bookmark-description">Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://azure.microsoft.com/favicon.ico?v2" alt="HPC Cloud Updates WE 19 Apr 2026"><span class="kg-bookmark-author">Microsoft Azure</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn-dynmedia-1.microsoft.com/is/image/microsoftcorp/ACOM-Roadmap-hero?resMode=sharp2&amp;op_usm=1.5,0.65,15,0&amp;wid=1600&amp;hei=635&amp;qlt=85&amp;fit=constrain" alt="HPC Cloud Updates WE 19 Apr 2026"></div></a></figure><hr><h2 id="google-cloud">Google Cloud</h2><p>Not much by way of updates but the Google Cloud Advanced Computing Community did manage to get Todd Gamblin (LLNL) over to talk about hardware diversification, AI integration and converged computing.</p><figure class="kg-card kg-embed-card"><iframe width="200" height="113" src="https://www.youtube.com/embed/0DirRTrevMM?list=PLr9wQzdybOFk3f_hrjOYlL73r2s0Q8xDy" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe></figure>]]></content:encoded></item><item><title><![CDATA[Quantum #107]]></title><description><![CDATA[Issue #107 of the weekly HPC newsletter by HMx Labs. Slurm got a bit of attention this week from Nvidia and the analysts but I think the bigger picture here is being missed. AWS gave us another way to abuse S3 and the DOE should probably talk to CERN.]]></description><link>https://cloudhpc.news/quantum-107/</link><guid isPermaLink="false">69dbfe831b3880bdc6b82010</guid><category><![CDATA[Quantum Weekly]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Mon, 13 Apr 2026 09:00:32 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/quantum-107.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/quantum-107.jpg" alt="Quantum #107"><p>Nvidia dropped a technical paper last week delving into topology aware scheduling on rack scale GPU clusters. Subsequent to their acquisition of Slurm, which now seems truly integrated into Mission Control, I think this is the first in depth guide from Nvidia on how to do this. A sign of further investment to come into the Slurm ecosystem?</p><p>It came though, in the same week as a piece from Reuters, contributed to by Intersect360 on the hazards of Nvidia&#x2019;s Slurm acquisition. Aside from highlighting that the concerns around this have not dissipated I don&#x2019;t think it really said anything new or wasn&#x2019;t said back in December last year at the time of the acquisition. In the intervening time though something else substantial has happened and I think worrying about the impact of Nvidia&#x2019;s acquisition of Slurm is almost missing the wood for the trees. We are now in an environment where the use of AI in coding is having a massive impact on open source software and communities around it. Everything from maintainers drowning in AI Slop PRs to AI washing source code (to change its license) to simply more easily forking and maintaining your own versions of popular open source projects. (Not a fork, but extensions like <a href="https://s9s.dev/?ref=cloudhpc.news" rel="noreferrer">S9S</a> are a good example for Slurm).</p><p>While our own work confirms (at least to me) we aren&#x2019;t going to see a vibe coded Slurm replacement any time soon, it also shows that forking Slurm and adapting it to your own specific use cases isn&#x2019;t the challenge it once was either. My concerns around how Slurm evolves under Nvidia&#x2019;s ownership remain, but honestly, I have other bigger concerns around FOSS more generally. <a href="https://scheduler-select.hmxlabs.io/?ref=cloudhpc.news" rel="noreferrer">There also isn&#x2019;t a shortage of alternatives to Slurm</a>.</p><p>In the classical HPC space the US DOE released SYNAPSE-I to handle real time data processing at HPC scale. On reading this I did wonder if they had spoken to CERN who face similar challenges, but I guess with geopolitics being the way they are right now that probably isn&#x2019;t going to happen.</p><p>Our own news from last week: If you haven&#x2019;t already told us, let me know what benchmarks you&#x2019;d like to see in our data to make your infrastructure selection decisions. Also, the piece on LLM inference as Monte Carlo paths seems to have resonated with a few of you and is worth a read if you haven&#x2019;t already.</p><p>Oh, and AWS gave us a real filesystem on top of S3. Given the number of HPC workloads already using S3 I give it all of 4 femtoseconds before the first HPC workload attempts to that too.</p><hr><h2 id="in-the-news">In The News</h2><p>Updates from the big three clouds on all things HPC.</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/hpc-cloud-updates-we-12-apr-2026/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">HPC Cloud Updates WE 12 Apr 2026</div><div class="kg-bookmark-description">Updates to AWS, Azure &amp; GCP in the last week relevant for HPC practitioners. A new Azure region, a filesystem over AWS S3 and running GPUs at scale from Google.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #107"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-12.jpg" alt="Quantum #107"></div></a></figure><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://developer.nvidia.com/blog/running-ai-workloads-on-rack-scale-supercomputers-from-hardware-to-topology-aware-scheduling/?utm_source=chatgpt.com"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Running AI Workloads on Rack-Scale Supercomputers: From Hardware to Topology-Aware Scheduling | NVIDIA Technical Blog</div><div class="kg-bookmark-description">The NVIDIA GB200 NVL72 and NVIDIA GB300 NVL72 systems, featuring NVIDIA Blackwell architecture, are rack-scale supercomputers. They&#x2019;re designed with 18 tightly coupled compute trays&#x2026;</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://developer-blogs.nvidia.com/wp-content/themes/nvidia/dist/images/favicon_300a1064.ico" alt="Quantum #107"><span class="kg-bookmark-author">NVIDIA Technical Blog</span><span class="kg-bookmark-publisher">Ryan Prout</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://developer-blogs.nvidia.com/wp-content/uploads/2026/04/gtc25-tech-blog-dgx-gb300-1920x1080-1.webp" alt="Quantum #107"></div></a></figure><p>Reuters and Intersect360 worry about Nvdia&apos;s Slurm acquisition.</p><p><a href="https://www.reuters.com/technology/nvidia-acquisition-schedmd-sparks-worry-among-ai-specialists-about-software-2026-04-06/?ref=cloudhpc.news">https://www.reuters.com/technology/nvidia-acquisition-schedmd-sparks-worry-among-ai-specialists-about-software-2026-04-06/</a></p><p>and our original written at the time:</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/nvidia-drinks-acquires-slurm/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Nvidia Drinks (Acquires) Slurm</div><div class="kg-bookmark-description">Some thoughts and questions on what Nvidia&#x2019;s acquisition of Slurm means for both AI and traditional HPC.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #107"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2025/12/futurama-nvidia-drinking-slurm.jpg" alt="Quantum #107"></div></a></figure><p>DOE goes real time with data analysis at HPC scale</p><p><a href="https://www.hpcwire.com/2026/04/09/new-real-time-ai-system-closes-the-gap-between-data-and-discovery-at-doe-labs/?ref=cloudhpc.news">https://www.hpcwire.com/2026/04/09/new-real-time-ai-system-closes-the-gap-between-data-and-discovery-at-doe-labs/</a></p><hr><h1 id="from-hmx-labs">From HMx Labs</h1><p>If using AI reliably means gating its output on automated validation, are we just aiming for LLM inference as Monte Carlo paths?</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/is-ai-actually-intelligent-or-just-a-smarter-dice-roll/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Is AI actually intelligent &#x2014; or just a smarter dice roll?</div><div class="kg-bookmark-description">I first heard the idea of using LLM inference as paths in a Monte Carlo simulation at the QuantMinds conference (November 2025). The same idea has now been suggested by people from the leading AI labs including Andrej Karparthy. Just in not so many words. Last month Andrej Karpathy release</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #107"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/Is-AI-just-a-smart-dice-roll.jpg" alt="Quantum #107"></div></a></figure><p>Want to help drive the direction of FLOPx and make sure we have benchmark data that you care about?&#xA0;</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/hardware-selection-benchmarks/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Hardware Selection Benchmarks</div><div class="kg-bookmark-description">What benchmarks should we add next to FLOPx?</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #107"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/server-shopping.jpg" alt="Quantum #107"></div></a></figure><hr><p>Know someone else who might like to read this newsletter? Forward this on to them or even better, ask them to sign up here:&#xA0;<a href="https://cloudhpc.news/">https://cloudhpc.news</a></p>]]></content:encoded></item><item><title><![CDATA[HPC Cloud Updates WE 12 Apr 2026]]></title><description><![CDATA[Updates to AWS, Azure & GCP in the last week relevant for HPC practitioners. A new Azure region, a filesystem over AWS S3 and running GPUs at scale from Google.]]></description><link>https://cloudhpc.news/hpc-cloud-updates-we-12-apr-2026/</link><guid isPermaLink="false">69db9dcf1b3880bdc6b81fee</guid><category><![CDATA[HPC Cloud Release Notes]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Sun, 12 Apr 2026 13:33:05 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-12.jpg" medium="image"/><content:encoded><![CDATA[<h2 id="aws">AWS</h2><img src="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-12.jpg" alt="HPC Cloud Updates WE 12 Apr 2026"><p>Perhaps not strictly HPC but I&#x2019;m already aware of people using S3 (and other BLOB storage exposing an S3 API) in HPC workloads that I&#x2019;m fairly sure that won&#x2019;t be long before we see HPC running on top of this too</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/blogs/aws/launching-s3-files-making-s3-buckets-accessible-as-file-systems/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Launching S3 Files, making S3 buckets accessible as file systems | Amazon Web Services</div><div class="kg-bookmark-description">Amazon S3 Files makes S3 buckets accessible as high-performance file systems on AWS compute resources, eliminating the tradeoff between object storage benefits and interactive file capabilities while enabling seamless data sharing with ~1ms latencies.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/main/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 12 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services</span><span class="kg-bookmark-publisher">S&#xE9;bastien Stormacq</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2026/04/07/Screenshot-2026-04-06-at-3.50.49%E2%80%AFPM.png" alt="HPC Cloud Updates WE 12 Apr 2026"></div></a></figure><p>Maybe a little more HPC. NetApp is available in London, Hyderabad and Sao Paolo now</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/about-aws/whats-new/2026/04/second-gen-amazon-fsx-ontap-regions/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Second-generation Amazon FSx for NetApp ONTAP is now available in four additional AWS commercial and AWS GovCloud (US) Regions - AWS</div><div class="kg-bookmark-description">Discover more about what&#x2019;s new at AWS with Second-generation Amazon FSx for NetApp ONTAP is now available in four additional AWS commercial and AWS GovCloud (US) Regions</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/libra-css/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 12 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services, Inc.</span></div></div></a></figure><p>This might help allocate costs to your HPC clients if you&#x2019;re running a platform with multiple users</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://aws.amazon.com/about-aws/whats-new/2026/04/ec2-capacity-manager-tag-based-dimensions/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Amazon EC2 Capacity Manager now supports tag-based dimensions - AWS</div><div class="kg-bookmark-description">Discover more about what&#x2019;s new at AWS with Amazon EC2 Capacity Manager now supports tag-based dimensions</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://a0.awsstatic.com/libra-css/images/site/touch-icon-ipad-144-smile.png" alt="HPC Cloud Updates WE 12 Apr 2026"><span class="kg-bookmark-author">Amazon Web Services, Inc.</span></div></div></a></figure><p>New Regional Instances:&#xA0; <a href="https://aws.amazon.com/about-aws/whats-new/2026/04/amazon-ec2-x8i-instances-CDG-region/?ref=cloudhpc.news" rel="noreferrer">X8i in Paris</a></p><hr><h2 id="azure">Azure</h2><p>New Azure region in Denmark</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://azure.microsoft.com/en-gb/updates/?id=559406&amp;ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Azure updates | Microsoft Azure</div><div class="kg-bookmark-description">Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://azure.microsoft.com/favicon.ico?v2" alt="HPC Cloud Updates WE 12 Apr 2026"><span class="kg-bookmark-author">Microsoft Azure</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn-dynmedia-1.microsoft.com/is/image/microsoftcorp/ACOM-Roadmap-hero?resMode=sharp2&amp;op_usm=1.5,0.65,15,0&amp;wid=1600&amp;hei=635&amp;qlt=85&amp;fit=constrain" alt="HPC Cloud Updates WE 12 Apr 2026"></div></a></figure><p>I guess if you&#x2019;re trying to run LLM inference using K8 this could be relevant.&#xA0;</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://azure.microsoft.com/en-gb/updates/?id=559547&amp;ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Azure updates | Microsoft Azure</div><div class="kg-bookmark-description">Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://azure.microsoft.com/favicon.ico?v2" alt="HPC Cloud Updates WE 12 Apr 2026"><span class="kg-bookmark-author">Microsoft Azure</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn-dynmedia-1.microsoft.com/is/image/microsoftcorp/ACOM-Roadmap-hero?resMode=sharp2&amp;op_usm=1.5,0.65,15,0&amp;wid=1600&amp;hei=635&amp;qlt=85&amp;fit=constrain" alt="HPC Cloud Updates WE 12 Apr 2026"></div></a></figure><hr><h2 id="google-cloud">Google Cloud</h2><p>Google gave us a guide on building GPU infrastructure</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloud.google.com/blog/products/compute/a-guide-to-architecting-reliable-gpu-infrastructure?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">A guide to architecting reliable GPU infrastructure | Google Cloud Blog</div><div class="kg-bookmark-description">Understand the core challenges behind reliable GPU-based infrastructure on Google Cloud, and find resources to dive deeper.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.gstatic.com/cloud/images/icons/favicon.ico" alt="HPC Cloud Updates WE 12 Apr 2026"><span class="kg-bookmark-author">Google Cloud</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://storage.googleapis.com/gweb-cloudblog-publish/images/05_-_Compute.max-2600x2600.jpg" alt="HPC Cloud Updates WE 12 Apr 2026"></div></a></figure><p>but also claims of better carbon efficient of its Ironwood TPUs</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloud.google.com/blog/topics/systems/ironwood-tpus-deliver-37x-carbon-efficiency-gains?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Ironwood TPUs deliver 3.7x carbon efficiency gains | Google Cloud Blog</div><div class="kg-bookmark-description">The seventh-generation Ironwood TPU achieved an approximately 3.7x improvement in Compute Carbon Intensity (CCI) compared to TPU v5p architecture.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.gstatic.com/cloud/images/icons/favicon.ico" alt="HPC Cloud Updates WE 12 Apr 2026"><span class="kg-bookmark-author">Google Cloud</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://storage.googleapis.com/gweb-cloudblog-publish/images/39_-_Systems_ugpuEWX.max-2600x2600.jpg" alt="HPC Cloud Updates WE 12 Apr 2026"></div></a></figure>]]></content:encoded></item><item><title><![CDATA[Hardware Selection Benchmarks]]></title><description><![CDATA[What benchmarks should we add next to FLOPx?]]></description><link>https://cloudhpc.news/hardware-selection-benchmarks/</link><guid isPermaLink="false">69d89c131b3880bdc6b81f78</guid><category><![CDATA[Random Bytes]]></category><category><![CDATA[HPC]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Fri, 10 Apr 2026 07:00:42 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/server-shopping.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/server-shopping.jpg" alt="Hardware Selection Benchmarks"><p>What are your go to benchmarks when selecting hardware? Both for GPU and CPU workloads?</p><p>We have a few changes we&#x2019;d still like to make to FLOPx, our benchmark data platform, but I&#x2019;d like to start extending it to include other benchmarks too. To have data that goes beyond what we have used already to serve our existing clients (which is very financial services focused).</p><p>This probably means open source benchmarks but I&#x2019;m open to including proprietary/ closed source benchmarks provided they can be licensed and meet our criteria on transparency and repeatability.</p><p>What else would you like to see? The QuantLib test suite? Something based on OpenFOAM (is there a standardised repeatable benchmark already out there rather than just running one of the samples?).</p><p>I&#x2019;ve been largely thinking of HPC related use cases, but I guess this is more widely applicable? Should we include some basics like CoreMark?</p><p>Friday afternoon musings on a post card please &#x1F604;</p>]]></content:encoded></item><item><title><![CDATA[Is AI actually intelligent — or just a smarter dice roll?]]></title><description><![CDATA[<p>I first heard the idea of using LLM inference as paths in a Monte Carlo simulation at the QuantMinds conference (November 2025). The same idea has now been suggested by people from the leading AI labs including Andrej Karparthy. Just in not so many words.</p><p>Last month Andrej Karpathy release</p>]]></description><link>https://cloudhpc.news/is-ai-actually-intelligent-or-just-a-smarter-dice-roll/</link><guid isPermaLink="false">69d5f6af1b3880bdc6b81f12</guid><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Wed, 08 Apr 2026 07:00:40 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/Is-AI-just-a-smart-dice-roll.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/Is-AI-just-a-smart-dice-roll.jpg" alt="Is AI actually intelligent &#x2014; or just a smarter dice roll?"><p>I first heard the idea of using LLM inference as paths in a Monte Carlo simulation at the QuantMinds conference (November 2025). The same idea has now been suggested by people from the leading AI labs including Andrej Karparthy. Just in not so many words.</p><p>Last month Andrej Karpathy release Auto Research. The idea being promoted is that an LLM, given tight validation criteria, can take a task and iteratively improve on the results and throw away anything that is wrong or produces a worse result. Not long ago I was at a private event with Anthropic presenting on the best ways to use Claude Code. One of the central points (of four) here was the importance of automated validation of the output of Claude Code.</p><p>So far nothing new here. I think those of use that have been using LLMs regularly independently came to the same conclusion too. So, we&#x2019;re all on the same page, right? Almost. There&#x2019;s a subtlety I missed here that I think the quants saw a year ago. I was just dismissive of it because of the cost of inference.</p><p>What no one is saying, is that if you have to gate the output of your &#x201C;AI&#x201D; on a validation step, is your &#x201C;AI&#x201D; actually intelligent? Or do you just have a better random number generator for your Monte Carlo model?</p><p>Let me try illustrating the point, probably poorly, with an example. It is possible to calculate the value of Pi just by defining a circle and generating random numbers. Draw a circle on a square piece of paper and throw 10,000 darts at it. You can now calculate the ratios of the areas (and therefore the value of Pi) by counting the number of darts inside and outside the circle.&#xA0;</p><figure class="kg-card kg-image-card"><img src="https://cloudhpc.news/content/images/2026/04/monte-carlo-pi.png" class="kg-image" alt="Is AI actually intelligent &#x2014; or just a smarter dice roll?" loading="lazy" width="1396" height="1396" srcset="https://cloudhpc.news/content/images/size/w600/2026/04/monte-carlo-pi.png 600w, https://cloudhpc.news/content/images/size/w1000/2026/04/monte-carlo-pi.png 1000w, https://cloudhpc.news/content/images/2026/04/monte-carlo-pi.png 1396w" sizes="(min-width: 720px) 720px"></figure><p>Imagine now if instead of a single circle we had two, concentric circles, one slightly smaller than the other. Like a car tyre. We can adjust the math to calculate Pi based on the number of darts that hit the tyre. The result is actually worse (less accurate) than the first method. But what if now we change the random number generator so it&#x2019;s not random. So, it throws the darts mostly around the tyre. The accuracy improves (for the same number of darts or Monte Carlo paths). I have some half-baked ideas of how this&#xA0;<em>might</em>&#xA0;apply to equity option pricing but I&#x2019;m not a quant so they&#x2019;re probably wrong. Give me a shout if you are a quant though and want to talk.</p><figure class="kg-card kg-image-card"><img src="https://cloudhpc.news/content/images/2026/04/ML-vs-MC-Pi.png" class="kg-image" alt="Is AI actually intelligent &#x2014; or just a smarter dice roll?" loading="lazy" width="2000" height="999" srcset="https://cloudhpc.news/content/images/size/w600/2026/04/ML-vs-MC-Pi.png 600w, https://cloudhpc.news/content/images/size/w1000/2026/04/ML-vs-MC-Pi.png 1000w, https://cloudhpc.news/content/images/size/w1600/2026/04/ML-vs-MC-Pi.png 1600w, https://cloudhpc.news/content/images/size/w2400/2026/04/ML-vs-MC-Pi.png 2400w" sizes="(min-width: 720px) 720px"></figure><p>This analogy though is what we have with LLMs. A better random number generator. That&#x2019;s not really random. It might even be mostly correct. But not always. So you need less throws of the dart (or MC paths).</p><p>I think the finance quants just saw this way before I did. Before most of us did. Before the AI Labs will say it out loud but will quietly point to it. I thought it was mad because the cost of one LLM inference is many thousands of times that of running a single MC path with traditional models. The cost of compute wouldn&#x2019;t make sense. But Vera Rubin drastically drops inferences costs I&#x2019;m told. Taalas has an ASIC that can run inference that is 100 to 1000 times faster (read cheaper) than current generation GPUs. Maybe we don&#x2019;t need 10,000 paths, maybe 100 will do. Maybe this isn&#x2019;t so mad. Even if it&#x2019;s just for a subset of limited used cases like real time risk.</p><p>The next problem of course becomes that not everything has a simple automatable verification. For some reason this isn&#x2019;t getting talked about much and when I do bring it up, people have this perplexed look on their face. Like I&#x2019;m the one that&#x2019;s mad. Some complex derivatives can take literal hours of expensive compute to price (though admittedly we seem to trade less and less of these weird things). Validating each fanciful idea from an LLM to see if it improved things would cost a fortune even if the inference is cheap.</p><p>Peter Higgs proposed the existence of the Higgs Boson in 1964. It took 40 years and the construction of the Large Hadron Collider at CERN to validate that idea.&#xA0;</p>]]></content:encoded></item><item><title><![CDATA[Quantum #106]]></title><description><![CDATA[Issue #106 of the weekly HPC newsletter by HMx Labs. How are we going to power all the GPUs? And which ones are worth powering up? France goes Exascale.]]></description><link>https://cloudhpc.news/quantum-106/</link><guid isPermaLink="false">69d363961b3880bdc6b81e52</guid><category><![CDATA[Quantum Weekly]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Mon, 06 Apr 2026 09:00:40 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/quantum-106.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/quantum-106.jpg" alt="Quantum #106"><p>MLCommons dropped version 6 of their benchmarks for AI inference last week amongst increasing concern for how we&#x2019;re going to power all the GPUs that Nvidia is cranking out.&#xA0;&#xA0;That&#x2019;s probably just as well, given we&#x2019;ll have to figure out which GPUs are going to be worth powering up and which ones we just leave sat in their boxes. Though as any seasoned HPC professionals will know, we probably should have started with the benchmarking and optimisation rather than the grandiose build plans.&#xA0;</p><p>Whilst optimisation doesn&#x2019;t give you the numbers you need to win the billionaire *cough* rocket *cough* measuring contest it might just give you a path to usable AI inference at scale. Might. If you&#x2019;re running converged classical and AI models the latest MLPerf Inference v6.0 benchmarks are probably quite handy. More details and the results down below. Also HPC Wire took this moment to remind us that its system performance that will count.&#xA0;</p><p>Most readers of this were already aware that we&#x2019;re never going to be able to power all the data centres that have been provisioned, planned or promised. I guess current affairs means that the rest of the news cycle is catching up to that reality. Microsoft signed a deal with Chevron and that&#x2019;s not something a younger version of me ever thought I&#x2019;d hear myself say. And Google has decided that natural gas is the way forward to power AI. Maybe they didn&#x2019;t get the memo about what&#x2019;s going on in the world.&#xA0;</p><p>Against this backdrop of news, France decided it was time to join the Exascale compute club. Not sure that means anything anymore with the HPC Top 500 being less and less relevant, missing as it is anything from the hyperscalers. I guess it still means something in the scientific community. Or in the measuring contests for politicians rather than billionaires.&#xA0;</p><p>That was a rather cynical take this week. I better go outside and get some sun. And some Easter chocolate.</p><hr><h2 id="in-the-news">In The News</h2><p>Updates from the big three clouds on all things HPC.</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/hpc-cloud-updates-we-05-apr-2026/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">HPC Cloud Updates WE 05 Apr 2026</div><div class="kg-bookmark-description">Updates to AWS, Azure &amp; GCP in the last week relevant for HPC practitioners. Log aggregation for Slurm on Azure and faster VM startup</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #106"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-05.jpg" alt="Quantum #106"></div></a></figure><p>MLCommons releases v6 of their inference benchmark. <a href="https://mlcommons.org/visualizer?ref=cloudhpc.news" rel="noreferrer">Results here</a></p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://mlcommons.org/2026/04/mlperf-inference-v6-0-results/?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">MLCommons Releases New MLPerf Inference v6.0 Benchmark Results - MLCommons</div><div class="kg-bookmark-description">MLCommons releases MLPerf Inference v6.0 results &#x2014; the most significant benchmark update to date, with new tests for text-to-video, GPT-OSS 120B, DLRMv3, vision-language models, and YOLOv11</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://mlcommons.org/wp-content/uploads/2024/10/cropped-favicon-270x270.png" alt="Quantum #106"><span class="kg-bookmark-author">MLCommons</span><span class="kg-bookmark-publisher">MLCommons</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://secure.gravatar.com/avatar/b4e843c84ed301440f20939d7c695f473d45ec1b4196944d4a6b09c5b23b6ce3?s=96&amp;d=mm&amp;r=g" alt="Quantum #106"></div></a></figure><p>Powering Data centres becomes critical</p><p><a href="https://www.axios.com/2026/04/02/google-natural-gas-ai-power-energy?ref=cloudhpc.news">https://www.axios.com/2026/04/02/google-natural-gas-ai-power-energy</a></p><p><a href="https://www.reuters.com/business/energy/microsoft-chevron-engine-no-1-sign-exclusive-deal-power-supply-2026-03-31/?ref=cloudhpc.news">https://www.reuters.com/business/energy/microsoft-chevron-engine-no-1-sign-exclusive-deal-power-supply-2026-03-31/</a></p><p>We might not get (or need?) all the compute that was planned</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://futurism.com/science-energy/data-centers-construction-supply?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Almost Half of US Data Centers That Were Supposed to Open This Year Slated to Be Canceled or Delayed</div><div class="kg-bookmark-description">The vast majority of data centers scheduled for completion over the next few years have yet to even break ground.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://futurism.com/wp-content/uploads/2025/09/favicon.png?quality=85&amp;w=192" alt="Quantum #106"><span class="kg-bookmark-author">Futurism</span><span class="kg-bookmark-publisher">Joe Wilkins</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://futurism.com/wp-content/uploads/2026/04/data-centers-construction-supply.jpg?w=1200" alt="Quantum #106"></div></a></figure><p>France set to join the Exascale club</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://www.eurohpc-ju.europa.eu/contract-signed-alice-recoque-europes-new-exascale-supercomputer-2025-11-18_en?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Contract Signed for Alice Recoque, Europe&#x2019;s New Exascale Supercomputer</div><div class="kg-bookmark-description">The procurement contract for Alice Recoque, the new European exascale supercomputer to be located in France, has been signed by the European High Performance Computing Joint Undertaking (EuroHPC JU) and the selected vendor, Eviden.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://www.eurohpc-ju.europa.eu/modules/custom/ewcms_extended/assets/images/hcp_favicon.ico" alt="Quantum #106"><span class="kg-bookmark-author">The European High Performance Computing Joint Undertaking (EuroHPC JU)</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://www.eurohpc-ju.europa.eu/sites/default/files/styles/ewcms_metatag_image/public/2025-11/Logo%20AR.png?h=134c66c7&amp;itok=o08zyu-Y" alt="Quantum #106"></div></a></figure><p>System performance is what counts</p><p><a href="https://www.hpcwire.com/2026/04/02/forget-about-chips-its-the-system-that-matters-for-ai/?ref=cloudhpc.news">https://www.hpcwire.com/2026/04/02/forget-about-chips-its-the-system-that-matters-for-ai/</a></p><hr><h2 id="from-hmx-labs">From HMx Labs</h2><p>&#xA0;This was just an April Fool&#x2019;s joke (though Corey Quinn says this entire category for faux series A pitch decks as a joke isn&#x2019;t funny anymore).</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://cloudhpc.news/hmx-labs-announces-continuum/"><div class="kg-bookmark-content"><div class="kg-bookmark-title">HMx Labs Announces Continuum&#x2122;</div><div class="kg-bookmark-description">90% cheaper compute that feels like On Demand VMs but is built on unused spot capacity across multiple providers.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://cloudhpc.news/favicon.ico" alt="Quantum #106"><span class="kg-bookmark-author">Flux by HMx Labs</span><span class="kg-bookmark-publisher">Hamza</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cloudhpc.news/content/images/2026/04/Continuum-Header.jpg" alt="Quantum #106"></div></a></figure><p>I hope you&#x2019;re enjoying your Easter chocolate!</p><figure class="kg-card kg-image-card"><img src="https://cloudhpc.news/content/images/2026/04/chocolate-easter-egg-rack-server.jpg" class="kg-image" alt="Quantum #106" loading="lazy" width="1408" height="768" srcset="https://cloudhpc.news/content/images/size/w600/2026/04/chocolate-easter-egg-rack-server.jpg 600w, https://cloudhpc.news/content/images/size/w1000/2026/04/chocolate-easter-egg-rack-server.jpg 1000w, https://cloudhpc.news/content/images/2026/04/chocolate-easter-egg-rack-server.jpg 1408w" sizes="(min-width: 720px) 720px"></figure><hr><p>Know someone else who might like to read this newsletter? Forward this on to them or even better, ask them to sign up here:&#xA0;<a href="https://cloudhpc.news/">https://cloudhpc.news</a></p>]]></content:encoded></item><item><title><![CDATA[HPC Cloud Updates WE 05 Apr 2026]]></title><description><![CDATA[Updates to AWS, Azure & GCP in the last week relevant for HPC practitioners. Log aggregation for Slurm on Azure and faster VM startup]]></description><link>https://cloudhpc.news/hpc-cloud-updates-we-05-apr-2026/</link><guid isPermaLink="false">69d347091b3880bdc6b81e41</guid><category><![CDATA[HPC Cloud Release Notes]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Mon, 06 Apr 2026 05:41:26 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-05.jpg" medium="image"/><content:encoded><![CDATA[<h2 id="aws">AWS</h2><img src="https://cloudhpc.news/content/images/2026/04/CloudHPCNews-2026-04-05.jpg" alt="HPC Cloud Updates WE 05 Apr 2026"><p>Neither I nor my AI bot could find anything from AWS this past week. Not even a new regional instance going live. I guess they&#x2019;re too busy battling the fires Kiro lit.</p><hr><h2 id="azure">Azure</h2><p>Need a centralised way to look at your HPC logs? Want a turnkey solution for Cycle Cloud Workspace for Slurm? Azure has some ideas</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://techcommunity.microsoft.com/blog/azurehighperformancecomputingblog/simplify-troubleshooting-at-scale---centralized-log-management-for-cyclecloud-wo/4470658?ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Simplify troubleshooting at scale - Centralized Log Management for CycleCloud Workspace for Slurm | Microsoft Community Hub</div><div class="kg-bookmark-description">Training large AI models on hundreds or thousands of nodes introduces a critical operational challenge: when a distributed job fails, quickly identifying the&#x2026;</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://techcommunity.microsoft.com/t5/s/gxcuf89792/m_assets/themes/customTheme1/favicon-1730836283320.png?time=1730836286415&amp;image-dimensions=180x180" alt="HPC Cloud Updates WE 05 Apr 2026"><span class="kg-bookmark-author">TECHCOMMUNITY.MICROSOFT.COM</span><span class="kg-bookmark-publisher">jesselopez</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://techcommunity.microsoft.com/t5/s/gxcuf89792/images/bS00NDcwNjU4LWNVVXhSbA?revision=2" alt="HPC Cloud Updates WE 05 Apr 2026"></div></a></figure><p>This could be handy, especially if building HPC capacity using spot compute. It&apos;s still in preview but we&#x2019;ll definitely be playing with this to see the potential impact.</p><figure class="kg-card kg-bookmark-card"><a class="kg-bookmark-container" href="https://azure.microsoft.com/en-gb/updates/?id=559322&amp;ref=cloudhpc.news"><div class="kg-bookmark-content"><div class="kg-bookmark-title">Azure updates | Microsoft Azure</div><div class="kg-bookmark-description">Subscribe to Microsoft Azure today for service updates, all in one place. Check out the new Cloud Platform roadmap to see our latest product plans.</div><div class="kg-bookmark-metadata"><img class="kg-bookmark-icon" src="https://azure.microsoft.com/favicon.ico?v2" alt="HPC Cloud Updates WE 05 Apr 2026"><span class="kg-bookmark-author">Microsoft Azure</span></div></div><div class="kg-bookmark-thumbnail"><img src="https://cdn-dynmedia-1.microsoft.com/is/image/microsoftcorp/ACOM-Roadmap-hero?resMode=sharp2&amp;op_usm=1.5,0.65,15,0&amp;wid=1600&amp;hei=635&amp;qlt=85&amp;fit=constrain" alt="HPC Cloud Updates WE 05 Apr 2026"></div></a></figure><hr><h2 id="google-cloud">Google Cloud</h2><p>Nope. Nothing here either.</p>]]></content:encoded></item><item><title><![CDATA[HMx Labs Announces Continuum™]]></title><description><![CDATA[90% cheaper compute that feels like On Demand VMs but is built on unused spot capacity across multiple providers.]]></description><link>https://cloudhpc.news/hmx-labs-announces-continuum/</link><guid isPermaLink="false">69ccbbb81b3880bdc6b81dd2</guid><category><![CDATA[Random Bytes]]></category><dc:creator><![CDATA[Hamza]]></dc:creator><pubDate>Wed, 01 Apr 2026 07:00:39 GMT</pubDate><media:content url="https://cloudhpc.news/content/images/2026/04/Continuum-Header.jpg" medium="image"/><content:encoded><![CDATA[<img src="https://cloudhpc.news/content/images/2026/04/Continuum-Header.jpg" alt="HMx Labs Announces Continuum&#x2122;"><p>Spot compute, but it just&#x2026; doesn&#x2019;t stop.</p><p>Continuum is a deterministic execution layer built entirely on top of spare capacity across&#xA0;Amazon Web Services,&#xA0;Microsoft Azure,&#xA0;Google Cloud,&#xA0;Oracle Cloud,&#xA0;IBM Cloud,&#xA0;CoreWeave,&#xA0;Nebius&#xA0;and&#xA0;Nscale.</p><p>Yes, we know how that sounds.</p><p>Under the hood, workloads are continuously fragmented, moved, and rehydrated across a global pool of spot instances. You no longer need to care about which or how many CPUs or GPUs you need.&#xA0;</p><p>No retries. No checkpointing ceremonies. No &#x201C;this is fine&#x201D; dashboards.</p><p>Think your workloads won&#x2019;t work with this? Our Agentic Adaptation Tools say otherwise.</p><p>Just point the&#xA0;Continuum Adaptation Agent (CAA)&#xA0;at your repo.</p><p>It will:</p><ul><li>Refactor your code to run on Continuum</li><li>Optimise for fragmentation, migration, and rehydration</li><li>Quietly remove anything that assumes infrastructure behaves predictably</li></ul><p>Not only that but you can also connect your on-prem cluster and&#xA0;sell spare capacity back into Continuum, turning that &#x201C;idle HPC estate&#x201D; into something your finance team suddenly cares about. And still use it in preference to renting someone else&#x2019;s capacity.</p><p>Still stuck on CUDA and Nvidia? The CAA will refactor your code to use the Continuum GPU Rosetta Layer so you can use any available GPU or accelerator.</p><p>Continuum. Making CPU and GPU compute as easy to use as electricity.</p><figure class="kg-card kg-image-card"><img src="https://cloudhpc.news/content/images/2026/04/continuum-explainer.jpg" class="kg-image" alt="HMx Labs Announces Continuum&#x2122;" loading="lazy" width="1408" height="768" srcset="https://cloudhpc.news/content/images/size/w600/2026/04/continuum-explainer.jpg 600w, https://cloudhpc.news/content/images/size/w1000/2026/04/continuum-explainer.jpg 1000w, https://cloudhpc.news/content/images/2026/04/continuum-explainer.jpg 1408w" sizes="(min-width: 720px) 720px"></figure><hr><p><strong>For the avoidance of any doubt: This was an April Fool&apos;s joke. </strong></p><p>I had a few conversations where I had to clarify that. I know it sounds real and we do some work in parts of this space but we don&apos;t have Continuum as a product I&apos;m afraid!</p>]]></content:encoded></item></channel></rss>