Compare
Compare local AI hardware with workload-aware output.
Operating mode: Balanced. Balanced for general local use. Keeps the ranking neutral across personal and serving workflows.
GTX 1080 Ti 11GB wins for coding in balanced mode
Based on model fit, speed, and quality across top recommendations.
GTX 1080 Ti 11GB
WinnerSQwen 3.5 9B
llama.cppQ4_K_MTight fit
9.7 GB / 11.0 GB
55.9 tok/s26K ctx
AGemma 4 E4B
llama.cppQ4_K_MRuns well
8.2 GB / 11.0 GB
47.7 tok/s51K ctx
ACodeGeeX 4 9B
llama.cppQ4_K_MRuns well
8.1 GB / 11.0 GB
56.9 tok/s92K ctx
RTX 3080 10GB
AGemma 4 E4B
llama.cppQ4_K_MRuns well
8.1 GB / 10.0 GB
81.0 tok/s40K ctx
ACodeGeeX 4 9B
llama.cppQ4_K_MRuns well
8.0 GB / 10.0 GB
96.7 tok/s68K ctx
SQwen 3.5 9B
llama.cppQ4_K_MRuns with offload
9.6 GB / 10.0 GB
91.2 tok/s19K ctx
Quick comparison
| Metric | GTX 1080 Ti 11GB | RTX 3080 10GB |
|---|---|---|
| Models that fit | 3 | 3 |
| Avg decode tok/s | 53.5 | 89.6 |
| Best grade score | 94 | 95 |
