Compare
Compare local AI hardware with workload-aware output.
Operating mode: Balanced. Balanced for general local use. Keeps the ranking neutral across personal and serving workflows.
Intel Arc Pro A40 6GB wins for coding in balanced mode
Based on model fit, speed, and quality across top recommendations.
Intel Arc Pro A40 6GB
WinnerAGemma 4 E2B
llama.cppQ4_K_MTight fit
5.1 GB / 6.0 GB
24.9 tok/s42K ctx
SQwen 3.5 4B
llama.cppQ4_K_MRuns with offload (needs ~0.1 GB host RAM)
6.1 GB / 6.0 GB
29.6 tok/s15K ctx
AGranite 4.1 3B
llama.cppQ4_K_MRuns well
4.6 GB / 6.0 GB
42.0 tok/s35K ctx
Intel Arc A380 6GB
AGemma 4 E2B
llama.cppQ4_K_MTight fit
5.1 GB / 6.0 GB
24.1 tok/s42K ctx
SQwen 3.5 4B
llama.cppQ4_K_MRuns with offload (needs ~0.1 GB host RAM)
6.1 GB / 6.0 GB
28.7 tok/s15K ctx
AGranite 4.1 3B
llama.cppQ4_K_MRuns well
4.6 GB / 6.0 GB
42.0 tok/s35K ctx
Quick comparison
| Metric | Intel Arc Pro A40 6GB | Intel Arc A380 6GB |
|---|---|---|
| Models that fit | 3 | 3 |
| Avg decode tok/s | 32.2 | 31.6 |
| Best grade score | 90 | 90 |
