AMD ROCm 7.1 vs. RADV Vulkan For Llama.cpp With The Radeon AI PRO R9700
Text generation performance with Llama.cpp was one of the first areas where the Vulkan back-end was performing better than ROCm. Even with the new ROCm 7.1 release, it's still that way with this RDNA4 GPU performing better on the RADV driver with Vulkan back-end.
Prompt processing performance is more mixed between ROCm and Vulkan depending upon the particular large language model and other factors.
With Qwen3 8B the ROCm back-end provided better prompt processing performance on the Radeon AI PRO R9700 graphics card.
With GPT-OSS 20B, the Vulkan back-end was faster for both text generation and prompt processing on this AMD graphics card.
