Ministral 3 14B Q4

Auto-discovered from measured benchmark results.. Caveat: Auto-generated family metadata; review for taxonomy accuracy..

Hardware Snapshot

Family Ministral 3
Scenario chat
License scope open-source
Quantization Q4
VRAM minimum 10GB
VRAM optimal 20GB
Best local GPU RTX 3090 24GB
Cloud fallback A6000 48GB
Updated 2026-02-24
Data status Verified by Real Hardware
Ollama source Library reference (verified: 2026-02-24)
Ollama tag ministral-3:14b
Category chat

Benchmark Anchors

Hardware Expected tok/s
RTX 3090 24GB 21
RTX 4090 24GB 28.4
A100 80GB 50.4

Real Hardware Benchmark (RTX 3090)

Tokens/s 82.665
Latency 2390 ms
Prompt tokens 575
Eval tokens 128
Test time 2026-04-01T11:53:50Z
GPU model NVIDIA GeForce RTX 3090

Verified by real hardware.

View raw nvidia-smi snapshot

Performance Curve

Reference anchors are baseline estimates. Measured RTX 3090 data is overlaid when available.

Best Hardware for Ministral 3 14B Q4

Local vs Cloud Cost Hint

Mode 40h / month 120h / month
Local power only (3090 baseline) $2.24 $6.72
A6000 48GB $30.4 $91.2
ollama run ministral-3:14b More chat models More 14b-class models Benchmark changelog Submit your test result Check local GPU upgrade

We may earn a commission if you click links on this page.