Find the Best GPU for Your LLM
Real benchmarks across -- GPUs and -- models. Pick a model, speed target, and budget.
GPU (used) + CPU + motherboard + 32 GB RAM. Component recommendations sourced from real benchmark hardware. Excludes PSU, case, storage.
Links below are Amazon affiliate links. Clicking costs you nothing extra.
No GPU found within your filters. Try raising the budget or lowering the minimum speed.
Benchmark Results
Tokens/second (median). Click column headers to sort. GPU names link to Amazon.
| Loading… |
Legend: ⚠ CPU offload — model exceeds VRAM, layers run on system RAM | OOM — tested, failed: not enough free RAM | ~ Partial offload — minor penalty
Model Deep Dive
Compare GPU performance for a specific model. See how different hardware stacks up.
tok/s — raw inference speed
Config Impact
Same GPU, different machine configurations (CPU / RAM). Shows variance from real-world environment differences.