Compare
Compare local AI hardware with workload-aware output.
GTX 1060 6GB wins for coding in balanced mode
Based on model fit, speed, and quality across top recommendations.
GTX 1060 6GB
WinnerAGemma 4 E2B
llama.cppQ4_K_MTight fit
5.1 GB / 6.0 GB
30.0 tok/s42K ctx
SQwen 3.5 4B
llama.cppQ4_K_MRuns with offload (needs ~0.1 GB host RAM)
6.1 GB / 6.0 GB
34.6 tok/s15K ctx
AGranite 4.1 3B
llama.cppQ4_K_MRuns well
4.6 GB / 6.0 GB
42.0 tok/s35K ctx
Quick comparison
| Metric | GTX 1060 6GB | Intel Arc Pro A40 6GB |
|---|---|---|
| Models that fit | 3 | 3 |
| Avg decode tok/s | 35.5 | 32.2 |
| Best grade score | 91 | 90 |
