ZeroPoint's nanosecond-scale memory compression could tame power-hungry AI infrastructure

Devin Coldewey

Updated 23 May 2024 at 12:30 pm·5-min read

AI is only the latest and hungriest market for high-performance computing, and system architects are working around the clock to wring every drop of performance out of every watt. Swedish startup ZeroPoint, armed with €5 million ($5.5 million USD) in new funding, wants to help them out with a novel memory compression technique at the nanosecond scale — and yes, it's exactly as complicated as it sounds.

The concept is this: losslessly compress data just before it enters RAM, and decompress it afterwards, effectively widening the memory channel by 50% or more just by adding one small piece to the chip.

Compression is, of course, a foundational technology in computing; as ZeroPoint CEO Klas Moreau (left in the image above, with co-founders Per Stenström and Angelos Arelakis) pointed out, "We wouldn't store data on the hard drive today without compressing it. Research suggests 70% of data in memory is unnecessary. So why don't we compress in memory?"

The answer is we don't have the time. Compressing a large file for storage (or encoding it, as we say when it's video or audio) is a task that can take seconds, minutes or hours depending on your needs. But data passes through memory in a tiny fraction of a second, shifted in and out as fast as the CPU can do it. A single microsecond's delay, to remove the "unnecessary" bits in a parcel of data going into the memory system would be catastrophic to performance.

Memory doesn't necessarily advance at the same rate as CPU speeds, though the two (along with lots of other chip components) are inextricably connected. If the processor is too slow, data backs up in memory — and if memory is too slow, the processor wastes cycles waiting on the next pile of bits. It all works in concert, as you might expect.

While super-fast memory compression has been demonstrated, it results in a second problem: Essentially, you have to decompress the data just as fast as you compressed it, returning it to its original state, or the system won't have any idea how to handle it. So unless you convert your whole architecture over to this new compressed-memory mode, it's pointless.

ZeroPoint claims to have solved both of these problems with hyper-fast, low-level memory compression that requires no real changes to the rest of the computing system. You add their tech onto your chip, and it's as if you've doubled your memory.

Although the nitty-gritty details will likely only be intelligible to people in this field, the basics are easy enough for the uninitiated to grasp, as Moreau proved when he explained it to me.

"What we do is take a very small amount of data — a cache line, sometimes 512 bits — and identify patterns in it," he said. "It's the nature of data, that's it's populated with not so efficient information, information that is sparsely located. It depends on the data: The more random it is, the less compressible it is. But when we look at most data loads, we see that we are in the range of two-four times [more data throughput than before]."

This isn't how memory actually looks. But you get the idea.

It's no secret that memory can be compressed. Moreau said that everyone in large-scale computing knows about the possibility (he showed me a paper from 2012 demonstrating it), but has more or less written it off as academic, impossible to implement at scale. But ZeroPoint, he said, has solved the problems of compaction — reorganizing the compressed data to be more efficient still — and transparency, so the tech not only works but works quite seamlessly in existing systems. And it all happens in a handful of nanoseconds.

"Most compression technologies, both software and hardware, are on the order of thousands of nanoseconds. CXL [compute express link, a high-speed interconnect standard] can take that down to hundreds," Moreau said. "We can take it down to three or four."

Here's CTO Angelos Arelakis explaining it his way:

https://youtu.be/_IdNOQwqXGc

ZeroPoint's debut is certainly timely, with companies around the globe in quest of faster and cheaper compute with which to train yet another generation of AI models. Most hyperscalers (if we must call them that) are keen on any technology that can give them more power per watt or let them lower the power bill a little.

The primary caveat to all this is simply that, as mentioned, this needs to be included on the chip and integrated from the ground up — you can't just pop a ZeroPoint dongle into the rack. To that end, the company is working with chipmakers and system integrators to license the technique and hardware design to standard chips for high-performance computing.

Of course that is your Nvidias and your Intels, but increasingly also companies like Meta, Google and Apple, which have designed custom hardware to run their AI and other high-cost tasks internally. ZeroPoint is positioning its tech as a cost savings, though, not a premium: Conceivably, by effectively doubling the memory, the tech pays for itself before long.

The €5 million A round just closed was led by Matterwave Ventures, with Industrifonden acting as the local Nordic lead, and existing investors Climentum Capital and Chalmers Ventures chipping in as well.

Moreau said that the money should allow them to expand into U.S. markets, as well as double down on the Swedish ones they are already pursuing.

Reuters
Apple's China smartphone shipments drop 6.7% as Huawei surges, data shows
BEIJING (Reuters) -Apple's smartphone shipments in China fell by 6.7% in the second quarter of 2024, as the tech giant faced intensifying competition from rivals like Huawei, according to data from market research firm Canalys. Apple's total shipments for the quarter ending in June stood at 9.7 million units, down from 10.4 million units in the same quarter last year, Canalys data shows. In contrast, Huawei's smartphone shipments surged 41% year-on-year to 10.6 milion in the quarter, bolstered by the launch of its new Pura 70 series in April.
Reuters
Amazon racing to develop AI chips cheaper, faster than Nvidia's, executives say
Inside Amazon.com's chip lab in Austin, Texas, half a dozen engineers on a Friday afternoon put a closely guarded new server design through its paces. The server was packed with Amazon's artificial intelligence chips that compete with those from market leader Nvidia, Amazon executive Rami Sinno said on Friday, during a visit to the lab. Amazon is developing its own processors to limit its reliance on costly Nvidia chips - the so-called Nvidia tax - that power some of the artificial intelligence cloud business at its Amazon Web Services, the main growth driver.
Fortune
Startup with ‘radical’ concept for AI chips emerges from stealth with $15 million to try to challenge Nvidia
Fractile thinks it can deliver 100 times faster performance 10 times cheaper than GPUs.
Reuters
CrowdStrike says over 97% of Windows sensors back online
The outage happened because the advanced platform contained a fault that forced computers running Microsoft's Windows operating system to crash and show the so-called blue screen of death. Microsoft said on Saturday about 8.5 million Windows devices had been affected in the outage that had left flights grounded, forced broadcasters off air and left customers without access to services such as healthcare or banking."Our recovery efforts have been enhanced thanks to the development of automatic recovery techniques and by mobilizing all our resources to support our customers," Kurtz said in a post on LinkedIn.
Digital Spy
My long-term iPhone 15 Pro Max review
I’ve been using the iPhone 15 Pro Max for over six months to come to my long-term verdict on the big-screen flagship with excellent cameras and battery life.
MediaOutReach
Southco Introduces A Flush-Mount E6-73 Constant Torque Hinge
HONG KONG SAR - Media OutReach Newswire - 26 July 2024 - Southco Asia Ltd., a subsidiary of Southco Inc., a leading global provider of engineered access solutions such as locks, latches, captive fasteners, electronic access solutions, and hinges/positioning technology, has developed a flush-mount version of its popular E6 constant torque hinge. The E6-73 Stainless Steel Constant Torque Hinge provides all the benefits of a torque hinge in a low-profile, corrosion-resistant package, making it an i
Reuters
Epic Games says Fortnite returning to iOS in EU, leaving Samsung app store
Epic has been attempting to expand the distribution of its games beyond smartphone companies' official app stores, opposing steep commissions on in-app payments and users being limited to downloading applications through dedicated stores. The company also said its videogames will be leaving the Samsung Galaxy Store in protest of the phone maker's decision to block default side-loading - the installation of applications on a mobile device without using its dedicated app store - on Android devices, calling it "anticompetitive". Along the same lines, Epic said its mobile games will come to AltStore on iOS in the EU.
Evening Standard
SearchGPT unveiled: OpenAI is launching a search engine to compete with Google
Here’s how you can sign up to be among the first to try the new AI-powered search tool
Engadget
Samsung Galaxy Buds 3 and Galaxy Buds 3 Pro review: AirPods clones that actually deliver
The Galaxy Buds 3 series offers two sets of great earbuds, but Samsung borrowed heavily from Apple to design them.
Yahoo Finance Video
OpenAI announces testing of new search engine, SearchGPT
OpenAI — which Microsoft (MSFT) owns a 49% stake in — is stepping into the search engine game. In a move that could rival Alphabet's (GOOG, GOOGL) Google search, the artificial intelligence developer unveiled its SearchGPT engine. The Market Domination team analyzes OpenAI's blog post outlining what it hopes to achieve in this search platform. For more expert insight and the latest market action, click here to watch this full episode of Market Domination. This post was written by Luke Carberry Mogan.
The Guardian
North Korea-backed cyber espionage campaign targets UK military
National Cyber Security Centre warns of global hacking effort to obtain nuclear and defence intelligence
Engadget
OpenAI unveils SearchGPT, an AI-powered search engine
The launch of SearchGPT comes amid growing competition in AI-powered search.
South China Morning Post
Chinese AI start-up Baichuan raises US$700 million from Alibaba, Tencent, Xiaomi
Baichuan AI, one of China's four so-called artificial intelligence (AI) tigers, raised about 5 billion yuan (US$687.6 million) in a new funding round that valued the start-up at more than 20 billion yuan, the company said on Thursday. The Beijing-based firm's latest round was backed by some of the biggest names in Chinese technology, including Alibaba Group Holding, Tencent Holdings and Xiaomi, along with some state-backed funds. Alibaba owns the South China Morning Post. China International Cap
Sky News
£7.7 million bounty offered in hunt for members of North Korea-backed hacking group
The UK, US and South Korea have accused a North Korea-backed cyber group of carrying out an online espionage campaign to steal military and nuclear secrets. The "Andariel" group has been compromising organisations around the globe as it attempts to get hold of sensitive and classified technical information and intellectual property data, according to the UK's National Cyber Security Centre (NCSC). The centre, along with the FBI in the US and South Korea's national intelligence service, have issued a joint warning and advisory note about Andariel's actions.
USA TODAY
Get an Apple AirTag tracking device for the lowest price we've seen in months
Keep a watchful eye on your keys, wallet, luggage, and more with an Apple AirTag. Get the tracker on sale at Amazon for just $24, the lowest price we've seen in months.
Cosmo
Rosalía goes braless and *almost* frees the nip in a lace naked dress
Rosalía stepped out wearing a breathtaking naked dress at the Prelude to the Olympics in Paris. The design was a nude coloured see-through lace gown by Dior.
HuffPost
‘I Approve This Message’: Kamala Harris Instantly Uses Trump’s Own Words Against Him
That didn’t take long.
SETHLUI.COM
Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon
The post Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon appeared first on SETHLUI.com.
Evening Standard
Arne Slot hails double new Liverpool addition with vital Premier League experience secured
New Reds boss hails latest Anfield arrivals as plans take shape
The Independent
Stranded Boeing astronauts are stuck on International Space Station, Nasa says in urgent update
The astronauts stranded on the International Space Station are still not able to come home, Nasa has said. Two astronauts went to the space station almost 50 days ago as part of a test of Boeing’s Starliner capsule. Test pilots Butch Wilmore and Suni Williams were supposed to visit the orbiting lab for about a week and return in mid-June, but thruster failures and helium leaks on Boeing‘s new Starliner capsule prompted Nasa and Boeing to keep them up longer.

ZeroPoint's nanosecond-scale memory compression could tame power-hungry AI infrastructure

Latest stories

Apple's China smartphone shipments drop 6.7% as Huawei surges, data shows

Amazon racing to develop AI chips cheaper, faster than Nvidia's, executives say

Startup with ‘radical’ concept for AI chips emerges from stealth with $15 million to try to challenge Nvidia

CrowdStrike says over 97% of Windows sensors back online

My long-term iPhone 15 Pro Max review

Southco Introduces A Flush-Mount E6-73 Constant Torque Hinge

Epic Games says Fortnite returning to iOS in EU, leaving Samsung app store

SearchGPT unveiled: OpenAI is launching a search engine to compete with Google

Samsung Galaxy Buds 3 and Galaxy Buds 3 Pro review: AirPods clones that actually deliver

OpenAI announces testing of new search engine, SearchGPT

North Korea-backed cyber espionage campaign targets UK military

OpenAI unveils SearchGPT, an AI-powered search engine

Chinese AI start-up Baichuan raises US$700 million from Alibaba, Tencent, Xiaomi

£7.7 million bounty offered in hunt for members of North Korea-backed hacking group

Get an Apple AirTag tracking device for the lowest price we've seen in months

Rosalía goes braless and almost frees the nip in a lace naked dress

‘I Approve This Message’: Kamala Harris Instantly Uses Trump’s Own Words Against Him

Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon

Arne Slot hails double new Liverpool addition with vital Premier League experience secured

Stranded Boeing astronauts are stuck on International Space Station, Nasa says in urgent update