Llama.cpp AI Performance With The GeForce RTX 5090
When looking at Mistral 7B for its 128 token text generation, it was showing off excellent generational uplift and similar in scope to the Llama 3.1 win... The RTX 5090 managed 1.58x the performance of the RTX 4090.
On a performance-per-Watt basis this $1999 USD graphics card remained comparable to the RTX 4080 / 4090 graphics cards.
The GPU temperatures of this NVIDIA GeForce RTX 5090 Founders Edition graphics card continue to be great for being a dual-slot graphics card and considering its higher power use.
For prompt processing with Mistral 7B, the RTX 5090 was at 1.17x the performance of the RTX 4090.
Let me know by commenting in the forums if interested in seeing more Llama.cpp GPU benchmarks moving forward. Apologies for the brief testing due to only having a NVIDIA RTX 50 Linux driver build for a few days. Thanks to NVIDIA for providing the GeForce RTX 5090 review sample for Linux testing at Phoronix.
If you enjoyed this article consider joining Phoronix Premium to view this site ad-free, multi-page articles on a single page, and other benefits. PayPal or Stripe tips are also graciously accepted. Thanks for your support.
