AMD Ryzen AI Max+ "Strix Halo" Performance With ROCm 7.0
Llama 3.1 was also showing much better performance with Vulkan on Strix Halo for Llama.cpp 6401 compared to HIP defaults.
Mistral was also running fine with ROCm 7.0 on Strix Halo but the best performance was found simply using Vulkan with the stock RADV Vulkan API driver.
Some of these Vulkan leads is much more significant than what we have seen out of ROCm HIP vs. Vulkan for Llama.cpp on discrete Radeon GPUs.
Llama.cpp with ROCm 7.0 installed on Ubuntu 24.04 LTS was working with the AMD Ryzen AI Max+ 395 (Strix Halo) in the Framework Desktop. But the best performance continued to be with using the Vulkan back-end instead when testing in the default configuration.
