12GB VRAM Models for Ollama

Balanced tier for 7B/13B with practical local runs.

Recommended models

Model Fit Expected tok/s
Mistral 7B Q8 good 18-28
Qwen 14B Q4 conditional 12-18
Check your fit Hardware upgrade Cloud GPU fallback