Nemotron 3 Nano 30B FP16

Auto-discovered from measured benchmark results.. Caveat: Auto-generated family metadata; review for taxonomy accuracy..

Hardware Snapshot

Family	Nemotron 3 Nano
Scenario	chat
License scope	open-source
Quantization	FP16
VRAM minimum	30GB
VRAM optimal	42GB
Best local GPU	RTX 6000 Ada 48GB
Cloud fallback	A100 80GB
Updated	2026-02-24
Data status	Verified by Real Hardware
Ollama source	Library reference (verified: 2026-02-24)
Ollama tag	`nemotron-3-nano:30b`
Category	chat

Benchmark Anchors

Hardware	Expected tok/s
RTX 3090 24GB	6.1
RTX 4090 24GB	8.2
A100 80GB	14.6

Real Hardware Benchmark (RTX 3090)

Tokens/s	57.048
Latency	2468 ms
Prompt tokens	37
Eval tokens	96
Test time	2026-04-01T11:53:50Z
GPU model	NVIDIA GeForce RTX 3090

Verified by real hardware.

View raw nvidia-smi snapshot

Performance Curve

Reference anchors are baseline estimates. Measured RTX 3090 data is overlaid when available.

Best Hardware for Nemotron 3 Nano 30B FP16

Local run: RTX 3090 (24GB) (Check latest deal) for around 57.048 tok/s on this profile.
Cloud run: RunPod A100 80GB , about 0.3x the local 3090 speed anchor.
Alternative cloud: Vast.ai options for flexible spot pricing.

Local vs Cloud Cost Hint

Mode	40h / month	120h / month
Local power only (3090 baseline)	$2.24	$6.72
A100 80GB	$78	$234

Related Model Profiles

Nemotron 3 Nano 30B Q4 18GB min, 28GB optimal
Nemotron 3 Nano 30B Q5 20GB min, 30GB optimal
Nemotron 3 Nano 30B Q8 24GB min, 34GB optimal

ollama run nemotron-3-nano:30b More chat models More 30b-34b models Benchmark changelog Submit your test result Run on RunPod Try Vast.ai

We may earn a commission if you click links on this page.