Quantum #114

Issue #114 of the weekly HPC newsletter by HMx Labs. More silicon news this week with a benchmarks for Nividia’s Vera CPU, ByteDance planning their own CPU and yet another AI inference option.

Quantum #114

Nothing for decades and suddenly tens of them come along at once. I think this must be the third week in a row that the interesting HPC/ supercomputing news is dominated by developments in silicon.

AWS has had Graviton for about 8 years, Microsoft Azure and Google Cloud both also decided to offer their own ARM based CPUs and even Meta has developed its own designs. Now ByteDance becomes the latest big tech company to plan to tape out its own custom chips. Though they are currently hedging their bets between it being an ARM or RISC-V architecture.

Meanwhile Phoronix has managed to get their hands on Nvidia’s Vera CPU. The selection of benchmarks results they’ve chosen is, shall we say, interesting. Not sure we’re likely to get our hands on any Vera kit any time soon to benchmark ourselves but we do currently have our mitts on some Intel CPUs and are considering if we include some of the Phoronix benchmarks in our analysis going forward. Specifically, the HPC related benchmarks and probably quanlib too. Maybe we should include SPEC too? All available for cloud or physical selection decisions via FLOPx.

A new startup, FuriosaAI, seems to have an alternative approach to accelerating inference, choosing to focus on high bandwidth data movement and tensor operations rather than a multitude of small cores. Sounds promising but modern GPUs sound awfully similar to that these days, no?

To address my concern from the previous two weeks about the difficultly in targeting multiple accelerator architectures it seems there are some people out there trying to solve this. I mean I kind hoped and assumed this would be case but hadn’t seen anything till last week in the form of ZML for one. I guess there may also be others, so if you know of any please do point me to them. I sincerely hope someone cracks this, not so much from a technical perspective, but more from a general adoption point of view.

TSMC seems to think efficiency might be the most critical metric its customers are now asking for. I’m actually surprised it took this long, given that the most bottlenecked resource seems to be the energy to power enough of these things. 

Lastly, we’re still hiring. We have open roles for both a HPC software engineer and marketing. Oh, and expect this to be the last issue of this newsletter branded as Quantum. A new look (but with the same content) coming next week!


In The News

Updates from the big three clouds on all things HPC. If you want the latest on AI too, just hit the buttons in the menu

Noteworthy by HMX Labs

 Silicon related shenanigans

https://www.reuters.com/world/china/bytedance-developing-custom-cpu-chips-support-ai-rollout-sources-say-2026-05-28/?utm_source=chatgpt.com

https://www.phoronix.com/review/nvidia-vera-benchmarks

https://www.hpcwire.com/2026/05/28/furiosaai-and-broadcom-team-up-to-build-rack-scale-inference-clusters/

An alternative to CUDA that works everywhere?

I think we broke the CUDA moat for good. Software built with ZML now runs transparently at max speed on: - CPU - NVIDIA GPUs - AMD GPUs - Google TPUs - AWS Trainium - Intel GPUs - Tenstorrent NPUs -… | Steeve Morin | 31 comments
I think we broke the CUDA moat for good. Software built with ZML now runs transparently at max speed on: - CPU - NVIDIA GPUs - AMD GPUs - Google TPUs - AWS Trainium - Intel GPUs - Tenstorrent NPUs - Apple GPUs (very experimental) That’s 8 different architectures. For instance, this screenshot is from 2 Intel GPUs working in Tensor Parallel mode, something Intel themselves only shipped in a very old vLLM fork. | 31 comments on LinkedIn
ZML - Model to Metal
ZML is a production inference stack, purpose-built to decouple AI workloads from proprietary hardware.

Is the focus finally going to change to efficiency too?

https://www.datacenterdynamics.com/en/news/energy-efficient-compute-is-most-important-attribute-for-customers-tsmc-claims/?utm_source=chatgpt.com


From HMx Labs

 New logo for this weekly newsletter incoming soon

Same Weekly Newsletter. New Name and Logo
I first started writing a weekly newsletter about HPC back in 2024. I figured it needed a name of some kind, but I didn’t really put much thought into it. I think I just Googled/ asked ChatGPT for some names that sounded cool. Wasn’t a great move as

Also we’re hiring not only for our marketing role but also an HPC software engineer now

Hiring: HPC Engineer
We’re looking for another HPC software engineer! Know someone that might be interested? Fancy pointing them my way please? This video was brought to you by the letters H, M and X and the number 42. Have a good weekend everyone. 0:00 /0:03 1× Apply on LinkedIn
Hiring: Marketing & Business Development
We’re looking for someone awesome to come help us shape the future of HMx Labs

Know someone else who might like to read this newsletter? Forward this on to them or even better, ask them to sign up here: https://cloudhpc.news