Hardware VRAM Tiers

8GB VRAM

Entry tier for lightweight chat and coding models.

Balanced tier for 7B/13B with practical local runs.

Comfort zone for 13B/14B and selected 32B Q4 setups.

High-value tier for large quantized models and long context tests.

Unified memory planning guide for local LLM inference on M-series devices.

Measured model tags and sustained-load benchmark snapshots on 3090.