Thanks @satyanadella ๐
Great work with @Azure on one of the largest MLPerf Training submission to-date on NVIDIA Blackwell: 8,192 GPUs on NVIDIA GB200 NVL72 systems, Llama 3.1 405B training target met in only 7.07 minutes.
More to come!
New Azure milestone. The fastest time to train yet at the largest reported scale for this leading AI training benchmark.
A great example of what is possible when we bring together full-stack innovation across silicon, systems, networking, and software, along with our deep
