Chinese AI DeepSeek speeds up Nvidia H800 by 8x using FlashMLA to bypass sanctions

115

china internet

 

According to the developers, the use of FlashMLA increases the number of Chinese company DeepSeek introduced FlashMLA technology, which allows to significantly increase the performance of Nvidia Hopper H800 chips.

 

What is FlashMLA?

 

FlashMLA is a software optimization that improves the performance of Nvidia Hopper processors without hardware changes. It increases the H800 memory bandwidth to 3000 GB/s, which is almost twice the standard maximum.

 

  • Low-rank key-value compression — an algorithm that breaks data fragments into smaller parts for faster processing.
  • Optimized memory usage — reduces memory consumption by 40–60%.
  • Dynamic resource allocation — the memory paging system adjusts the load depending on the task, which speeds up the processing of variable-length sequences.

 

Bypassing US sanctions?

 

DeepSeek FlashMLA demonstrates the potential of software optimizations for the Chinese AI industry. In fact, it allows the H800 to be used with efficiency close to that of the more powerful H100, the supply of which to China is limited by sanctions.

 

So far, FlashMLA only works with the H800, but a possible expansion to other models could significantly impact the AI ​​computing market.

 

In addition to DeepSeek, Chinese researchers continue to develop methods to increase the power of available GPUs. Recently, scientists from Shenzhen University and Beijing Institute of Technology increased the performance of the Nvidia RTX 4070 in peridynamics tasks by 800 times. However, this project has military-industrial implications, since it was developed in collaboration with Russian specialists.


Don't miss interesting news

Subscribe to our channels and read announcements of high-tech news, tes

Leave a Reply

Your email address will not be published. Required fields are marked *




[sam_pro id='2_4' codes='true'] [sam_pro id='2_16' codes='true']
[sam_pro id='2_5' codes='true'] [sam_pro id='2_17' codes='true']
[sam_pro id='2_8' codes='true']

Articles & testsArticles

Samsung Galaxy Fold7: not a smartphone, not a tablet, something more Samsung Galaxy Fold7 test

Samsung Galaxy Fold series of smartphones is notable for its folding design and large display. The new generation model had an even larger screen, advanced cameras, stronger hardware and improved ergonomics.


NVIDIA Blackwell – architecture with new capabilities for AI and content creation NVIDIA Blackwell

NVIDIA technologies that were previously only available to owners of professional graphics cards are now open to ordinary users. Let’s talk about the capabilities of the Blackwell architecture in the field of artificial intelligence and content creation.


NewsNews
| 16.08
Garmin Instinct Crossover – hybrid smartwatch costs $600 with AMOLED screen, flashlight and sapphire glass
Garmin Instinct Crossover

Garmin Instinct Crossover has a 1.2-inch AMOLED display with RevoDrive analog hands covered with Super-LumiNova luminescent compound.

| 13.05
New EcoFlow with 288 Wh capacity weigh less than 3 kilograms
EcoFlow Trail Plus 300 DC

EcoFlow has introduced a new line of portable charging stations in Europe, the Trail series. This device has already been on sale in the US