VOOZH about

URL: https://www.phoronix.com/review/amd-epyc-9755-avx512/7

⇱ AVX-512 Performance With 256-bit vs. 512-bit Data Path For AMD EPYC 9005 CPUs Review - Phoronix


👁 Phoronix

AVX-512 Performance With 256-bit vs. 512-bit Data Path For AMD EPYC 9005 CPUs

Written by Michael Larabel in Processors on 11 October 2024 at 10:40 AM EDT. Page 7 of 7. 21 Comments.

Running with the AVX-512 FP512 default mode didn't show any hit to the CPU peak frequency being achieved nor did it when operating in the 256-bit data path configuration.

There was also no significant difference in the CPU power consumption with either the AVX-512 FP256 or FP512 modes. The AMD EPYC 9755 across the assortment of benchmarks was consuming 292~305 Watts on average with a peak of 502~505 Watts, right in line with the default TDP rating.

There was also no real difference in the AMD EPYC 9755 core temperature across the different AVX-512 modes... Much better than the early days of AVX-512 on Intel Xeon where there was a significant impact to power/thermals from leveraging Advanced Vector Extensions 512. The AMD EPYC 9755 with the liquid cooling within the AMD Volcano reference server allowed the CPU to operate at 49~50 degrees (C) on average during these AVX-512 benchmarks and peaked at 63~67 degrees during the most harsh workloads.

When taking the geometric mean of all the raw AVX-512 performance benchmarks, AVX-512 in the default FP512 configuration yielded 1.45x the performance compared to disabling AVX-512 outright. Having the 512-bit data path allowed for 1.12x the performance compared to running the EPYC 9755 processor in the 256-bit data path mode, similar to how AVX-512 operates with Zen 4.

So all-in these latest benchmarks do show that the 512-bit data path mode for AVX-512 on AMD Zen 5 is valuable for servers/HPC and like with the FP256 mode there isn't any measurable impact to the CPU power consumption / clock frequencies / thermals. Similar to what I've seen with the Ryzen 9000 series desktop processors, AVX-512 with AMD Zen 5 appears to be very performant and robust. With AMD EPYC 9005 series for HPC servers this 512-bit data path is likely to be quite rewarding and a good design choice by AMD. The timing is important as well with increasing AVX-512 usage outside of HPC to speedy JSON parsing, AVX-512 optimizations for PostgreSQL, and other more conventional workloads.

If you enjoyed this article consider joining Phoronix Premium to view this site ad-free, multi-page articles on a single page, and other benefits. PayPal or Stripe tips are also graciously accepted. Thanks for your support.

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.