Compare
Compare local AI hardware with workload-aware output.
RTX 4090 24GB wins for coding in balanced mode
Based on model fit, speed, and quality across top recommendations.
RTX 4080 Super 16GB
SQwen 3.5 9B
llama.cppq4-k-mRuns well
10.2 GB / 16.0 GB
119.6 tok/s58K ctx
SNemotron Nano 9B v2
llama.cppq4-k-mRuns well
10.4 GB / 16.0 GB
119.6 tok/s52K ctx
ACodeGeeX 4 9B
llama.cppq6-kRuns well
10.5 GB / 16.0 GB
95.1 tok/s131K ctx
Quick comparison
| Metric | RTX 4080 Super 16GB | RTX 4090 24GB |
|---|---|---|
| Models that fit | 3 | 3 |
| Avg decode tok/s | 111.4 | 37.9 |
| Best grade score | 97 | 93 |
