Loading GPU benchmark data…
← All GPU benchmarks
GPU Benchmark
Tokens/sec by Model
Median tokens per second across all benchmark runs on this GPU. CPU offload means the model exceeded VRAM and ran partly on system RAM.
Loading…
Legend: CPU offload — model larger than VRAM, runs on system RAM (2–5× slower) | Partial — some layers spill to system RAM, minor speed penalty