Compare
Compare local AI hardware with workload-aware output.
NVIDIA H200 PCIe 141GB wins for coding in balanced mode
Based on model fit, speed, and quality across top recommendations.
NVIDIA H200 PCIe 141GB
WinnerSQwen 3.5 122B A10B
llama.cppq4-k-mRuns well
91.9 GB / 141.0 GB
162.1 tok/s131K ctx
SDevstral 2 123B Instruct
llama.cppq4-k-mRuns well
95.4 GB / 141.0 GB
58.4 tok/s152K ctx
SQwen3-Coder-Next
llama.cppq8-0Runs well
102.1 GB / 141.0 GB
170.7 tok/s256K ctx
Quick comparison
| Metric | NVIDIA H200 PCIe 141GB | RTX 4090 24GB |
|---|---|---|
| Models that fit | 3 | 3 |
| Avg decode tok/s | 130.4 | 37.9 |
| Best grade score | 98 | 93 |
