NVIDIA
Desktop GPU
Desktop GPU

NVIDIA Grace Blackwell Ultra GB300

288 GB HBM3e Β· 8000 GB/s

From

$30,000

Estimated street price

VRAM

288 GB

Bandwidth

8000 GB/s

TDP

1200W

Models

64

Tier

Datacenter-Class

The NVIDIA Grace Blackwell Ultra GB300 with 288 GB HBM3e VRAM can handle 64 AI models across embedding, ai_building, coding. Best performance: all-MiniLM-L6-v2 at 3000 tok/s (excellent). For AI coding workflows, it supports the Full AI Builder tier, supporting concurrent coding + reasoning + embeddings. Current price: approximately $30,000.

Source: OwnRig methodology

VRAM

288 GB

Bandwidth

8000 GB/s

Memory Type

HBM3e

TDP

1200W

Form Factor

Integrated module

Builder Capability: Datacenter-Class AI Workstation

Runs very large models at high precision with room for long context windows. Best suited to Linux-first, DGX-style professional deployments rather than a typical consumer PC build.

Software

Inference Backends

The software stacks that matter most for real-world inference on this device.

CUDA

production

Primary datacenter inference backend for NVIDIA's GB300 platform.

What it can run

64 models
all-MiniLM-L6-v2FP163000 tok/sExcellent
Arcee Trinity Large Thinking 400BQ4_K_M41 tok/sExcellent
Arcee Trinity Mini 26BQ8_0332 tok/sExcellent
Arcee Trinity Nano 6BQ8_01411 tok/sExcellent
Code Llama 34B InstructQ5_K_M135 tok/sExcellent
Codestral 22BQ5_K_M180 tok/sExcellent
Command R 35BQ8_0110 tok/sExcellent
DeepSeek Coder V2 Lite 16BQ8_0210 tok/sExcellent
DeepSeek R1Q4_K_M20 tok/sGood
DeepSeek R1 Distill Qwen 32BQ8_0120 tok/sExcellent
DeepSeek R1 Distill Qwen 7BQ8_0360 tok/sExcellent
DeepSeek V3Q4_K_M22 tok/sGood
FLUX.1 DevFP1615 tok/sExcellent
Gemma 2 27B InstructQ5_K_M145 tok/sExcellent
Gemma 2 9B InstructQ8_0320 tok/sExcellent
Gemma 3 12BQ8_0250 tok/sExcellent
Gemma 3 27BQ8_0130 tok/sExcellent
Gemma 3 4BQ8_0500 tok/sExcellent
Gemma 4 26B-A4BQ8_0500 tok/sExcellent
Gemma 4 31BQ8_0183 tok/sExcellent
Gemma 4 E2BQ8_0500 tok/sExcellent
Gemma 4 E4BQ8_0500 tok/sExcellent
GigaChat Lightning 10BQ8_0320 tok/sExcellent
InternLM 2.5 7B ChatQ8_0350 tok/sExcellent
Llama 3.1 70B InstructQ5_K_M65 tok/sExcellent
Llama 3.1 8B InstructQ8_0350 tok/sExcellent
Llama 3.2 11B VisionQ8_0260 tok/sExcellent
Llama 3.2 1B InstructQ8_0800 tok/sExcellent
Llama 3.2 3B InstructQ8_0650 tok/sExcellent
Llama 3.3 70B InstructQ8_055 tok/sExcellent
Llama 4 ScoutQ8_040 tok/sExcellent
LLaVA 1.6 13BQ5_K_M270 tok/sExcellent
Mistral 7B Instruct v0.3Q8_0380 tok/sExcellent
Mistral Large 2 123BQ8_030 tok/sGood
Mistral Small 24B InstructQ8_0150 tok/sExcellent
Mixtral 8x7B InstructQ5_K_M100 tok/sExcellent
nomic-embed-text v1.5FP162000 tok/sExcellent
NVIDIA Nemotron-3-super-120B-A12BQ4_K_M180 tok/sExcellent
Phi-3 Medium 14B InstructQ8_0230 tok/sExcellent
Phi-3 Mini 3.8B InstructQ8_0550 tok/sExcellent
Phi-4 14BQ8_0220 tok/sExcellent
Phi-4 MiniQ8_0580 tok/sExcellent
Qwen 2.5 14B InstructQ8_0220 tok/sExcellent
Qwen 2.5 72B InstructQ4_K_M60 tok/sExcellent
Qwen 2.5 7B InstructQ8_0360 tok/sExcellent
Qwen 2.5 Coder 32B InstructQ5_K_M140 tok/sExcellent
Qwen 2.5 Coder 7B InstructQ8_0360 tok/sExcellent
Qwen3-14B InstructQ8_0230 tok/sExcellent
Qwen3-30B-A3BQ8_0145 tok/sExcellent
Qwen3-32B InstructQ8_0120 tok/sExcellent
Qwen3-8B InstructQ8_0340 tok/sExcellent
Qwen3.5-122B-A10BQ8_0200 tok/sExcellent
Qwen3.5-27BQ8_0150 tok/sExcellent
Qwen3.5-397B (MoE)Q4_K_M120 tok/sExcellent
Qwen3.6-27BQ8_0150 tok/sExcellent
Qwen3.6-35B-A3BQ5_K_M145 tok/sExcellent
QwQ 32B PreviewQ5_K_M140 tok/sExcellent
Stable Diffusion 3 MediumFP1620 tok/sExcellent
Stable Diffusion 3.5 LargeFP1612 tok/sExcellent
Stable Diffusion XL 1.0FP1618 tok/sExcellent
StarCoder 2 15BQ8_0210 tok/sExcellent
Whisper Large V3FP16450 tok/sExcellent
Whisper Large V3 TurboFP16600 tok/sExcellent
Yi 1.5 34B ChatQ8_0110 tok/sExcellent

Showing 64 of 64 entries

Ready to Buy

Available in these Machines

Buy Used

Prices and availability vary. Inspect hardware before purchasing. Some links may be affiliate links.

FAQ

Frequently Asked Questions

What AI models can NVIDIA Grace Blackwell Ultra GB300 run?
The NVIDIA Grace Blackwell Ultra GB300 can run 64 AI models. Top performers include all-MiniLM-L6-v2, nomic-embed-text v1.5, Arcee Trinity Nano 6B. See the full compatibility table above for speeds and quality ratings.
Is NVIDIA Grace Blackwell Ultra GB300 good for AI coding?
Yes. With 288 GB, the NVIDIA Grace Blackwell Ultra GB300 supports the Full AI Builder tier: concurrent coding + reasoning + embeddings.
How much VRAM does NVIDIA Grace Blackwell Ultra GB300 have?
The NVIDIA Grace Blackwell Ultra GB300 has 288 GB of HBM3e VRAM with 8000 GB/s bandwidth.
Can NVIDIA Grace Blackwell Ultra GB300 run 70B models?
Yes. The NVIDIA Grace Blackwell Ultra GB300 can run 70B parameter models in memory at quantized quality.
Is NVIDIA Grace Blackwell Ultra GB300 worth it for AI?
At $30,000, the NVIDIA Grace Blackwell Ultra GB300 offers 288 GB HBM3e VRAM and runs 64 AI models. It handles local AI inference well.

Own this GPU?

See every AI model it supports, expected performance, and how to build around it.

Check my rig