Inside MemKV, MinIO’s 3.5G Solution for KV Cache Acceleration
MinIO rolled out its second major product earlier this month. Dubbed MemKV, the software expands the KV cache layer in AI inference clusters, thereby enabling...
It’s time once again for HPC Career Notes, our monthly feature that’s designed to keep you up-to-date on the latest career developments for individuals in the HPC community, including promotion,…
FuriosaAI and Broadcom are teaming up to develop a next-generation AI inference cluster that combines hundreds of FuriosaAI’s third-generation chips with Broadcom’s high-bandwidth, low-latency Ethernet interconnect. The as-yet unnamed system…
As we head into summer, data centers around the world are facing a challenging weather pattern similar to what we saw in 2025. Last July, the average global temperature hit…
U.S. sanctions were designed to limit China’s access to the most advanced semiconductor technologies. The sanctions include limited access to advanced chips from companies like NVIDIA and AMD. What this…
Artificial intelligence is poised to give scientists a leg up with abstract components of scientific reasoning. That’s the target for the...
AI has entered an industrial phase, no longer confined to isolated models or experimental deployments. AI now operates as always-on AI...
This annual, all-hands conference, keynoted by Dario Gil, Under Secretary for Science at the US DOE, convenes AI leaders from industry, academia, and national labs to develop best practices for utilizing AI for scientific discovery and engineering at scale.
MinIO rolled out its second major product earlier this month. Dubbed MemKV, the software expands the KV cache layer in AI inference clusters, thereby enabling...
There is a certain rhythm to the technology industry that I’ve observed over several decades: We find a shiny new tool, we promise it will...
Yesterday the U.S. Department of Commerce announced letters of intent (LOIs) with nine quantum computing and manufacturing companies for more than $2 billion in proposed...
Something about building GenAI LLMs bugs me. Before I begin, let me be clear: I am a supporter of AI technologies, particularly in science. Lately,...
For years, hyperscalers built infrastructure largely for their own cloud ecosystems. However, AI is starting to break that model. The sheer cost of accelerators, data...
Quantum computing is a field of computer science that utilizes quantum mechanics to process information at very high speeds and solve problems that may be...
In order to fit today’s neural networks onto hardware, some practitioners utilize some type of weight-pruning method to compress the…
Google is launching a series of new tools to help scientists leverage AI technology as a force multiplier to accelerate…
AI infrastructure and software company Scale AI has signed an MOU with the Department of Energy to support the Genesis…
The Spectra supercomputer built by Penguin Solutions using Maverick-2 chips from NextSilicon received fully system acceptance by Sandia National Laboratories,…
The 2026 State of AI Infrastructure Report, based on a survey of 600 U.S. IT and business leaders, reveals that infrastructure—more than AI models or accelerators—is now the critical determinant of enterprise AI success. As organizations transition from experimentation to production, four key pressures have emerged: rising infrastructure complexity, strained...
After a blowout year in 2024, HPC spending slowed a bit in 2025, posting a more modest–but still very impressive–gain…
Which scheduler should you use for AI workloads, Slurm or Kubernetes? It’s a debate that generates passionate arguments on both…
For most of the AI boom, the general assumption regarding AI hardware was to keep scaling GPU clusters and the…
The GenAI boom has made hardware hot, both literally and figuratively. Unfortunately, the huge demand for infrastructure has completely disrupted…