Compare
Compare local AI hardware with workload-aware output.
NVIDIA GH200 96GB wins for coding in balanced mode
Based on model fit, speed, and quality across top recommendations.
NVIDIA GH200 96GB
WinnerSQwen3-Coder-Next
llama.cppQ4_K_MRuns well
60.8 GB / 96.0 GB
218.8 tok/s256K ctx
SQwen 3.5 122B A10B
llama.cppQ4_K_MTight fit
87.4 GB / 96.0 GB
130.3 tok/s73K ctx
SMistral Small 4 119B
llama.cppQ4_K_MTight fit
88.5 GB / 96.0 GB
141.2 tok/s38K ctx
Quick comparison
| Metric | NVIDIA GH200 96GB | NVIDIA H20 96GB |
|---|---|---|
| Models that fit | 3 | 3 |
| Avg decode tok/s | 163.4 | 163.4 |
| Best grade score | 96 | 96 |
