Compare
Compare local AI hardware with workload-aware output.
NVIDIA H200 PCIe 141GB wins for coding in balanced mode
Based on model fit, speed, and quality across top recommendations.
NVIDIA H200 PCIe 141GB
WinnerSDevstral 2 123B Instruct
llama.cppQ4_K_MRuns well
95.4 GB / 141.0 GB
58.4 tok/s152K ctx
SQwen 3.5 122B A10B
llama.cppQ4_K_MRuns well
91.9 GB / 141.0 GB
162.1 tok/s131K ctx
SMistral Small 4 119B
llama.cppQ4_K_MRuns well
93.0 GB / 141.0 GB
175.8 tok/s159K ctx
Quick comparison
| Metric | NVIDIA H200 PCIe 141GB | NVIDIA H200 141GB |
|---|---|---|
| Models that fit | 3 | 3 |
| Avg decode tok/s | 132.1 | 132.1 |
| Best grade score | 98 | 98 |
