Qwen3 Coder 30B Q5
Popular Ollama model family: Qwen3 Coder. Caveat: Estimated values are placeholders unless marked measured..
Hardware Snapshot
| Family | Qwen3 Coder |
|---|---|
| Scenario | coding |
| License scope | open-source |
| Quantization | Q5 |
| VRAM minimum | 20GB |
| VRAM optimal | 30GB |
| Best local GPU | RTX 6000 Ada 48GB |
| Cloud fallback | A100 80GB |
| Updated | 2026-02-24 |
| Data status | Verified by Real Hardware |
| Ollama source | Library reference (verified: 2026-02-24) |
| Ollama tag | qwen3-coder:30b |
| Category | coding |
Benchmark Anchors
| Hardware | Expected tok/s |
|---|---|
| RTX 3090 24GB | 9.9 |
| RTX 4090 24GB | 13.4 |
| A100 80GB | 23.8 |
Real Hardware Benchmark (RTX 3090)
| Tokens/s | 140.492 |
|---|---|
| Latency | 935 ms |
| Prompt tokens | 29 |
| Eval tokens | 81 |
| Test time | 2026-06-17T07:31:11Z |
| GPU model | NVIDIA GeForce RTX 3090 |
Verified by real hardware.
Performance Curve
Reference anchors are baseline estimates. Measured RTX 3090 data is overlaid when available.
Best Hardware for Qwen3 Coder 30B Q5
- Local run: RTX 3090 (24GB) (Check latest deal) for around 140.492 tok/s on this profile.
- Cloud run: RunPod A100 80GB , about 0.2x the local 3090 speed anchor.
- Alternative cloud: Vast.ai options for flexible spot pricing.
Local vs Cloud Cost Hint
| Mode | 40h / month | 120h / month |
|---|---|---|
| Local power only (3090 baseline) | $2.24 | $6.72 |
| A100 80GB | $78 | $234 |
Related Model Profiles
- Qwen3 Coder 30B Q4 18GB min, 28GB optimal
- Qwen3 Coder 30B Q8 24GB min, 34GB optimal
- Qwen3 Coder 30B FP16 30GB min, 42GB optimal
- Qwen3 Coder 30B CLOUD 20GB min, 28GB optimal
ollama run qwen3-coder:30b More coding models More 30b-34b models Benchmark changelog Submit your test result Run on RunPod Try Vast.ai We may earn a commission if you click links on this page.