NVIDIA
Desktop GPU
Desktop GPU

NVIDIA RTX PRO 6000 Blackwell

96 GB GDDR7 Β· 1800 GB/s

From

$7,500

Estimated street price

VRAM

96 GB

Bandwidth

1800 GB/s

TDP

600W

Models

63

Tier

Full

The NVIDIA RTX PRO 6000 Blackwell with 96 GB GDDR7 VRAM can handle 63 AI models across embedding, ai_building, coding. Best performance: all-MiniLM-L6-v2 at 3000 tok/s (excellent). For AI coding workflows, it supports the Full AI Builder tier, supporting concurrent coding + reasoning + embeddings. Current price: approximately $7,500.

Source: OwnRig methodology

VRAM

96 GB

Bandwidth

1800 GB/s

Memory Type

GDDR7

TDP

600W

Form Factor

3-slot, 330mm

Builder Capability: Full AI Builder

Supports concurrent coding + reasoning + embeddings. Can run 70B models at quantized precision.

Software

Inference Backends

The software stacks that matter most for real-world inference on this device.

CUDA

production

Primary high-performance backend for NVIDIA workstation inference.

Vulkan

stable

Fallback backend for llama.cpp and related local runtimes.

What it can run

63 models
all-MiniLM-L6-v2FP163000 tok/sExcellent
Arcee Trinity Mini 26BQ8_075 tok/sExcellent
Arcee Trinity Nano 6BQ8_0318 tok/sExcellent
Code Llama 34B InstructQ5_K_M48 tok/sGood
Codestral 22BQ5_K_M73 tok/sGood
Command R 35BQ8_028 tok/sAcceptable
DeepSeek Coder V2 Lite 16BQ8_063 tok/sGood
DeepSeek R1Q2_K–Not viable
DeepSeek R1 Distill Qwen 32BQ8_030 tok/sAcceptable
DeepSeek R1 Distill Qwen 7BQ8_0129 tok/sExcellent
DeepSeek V3Q2_K–Not viable
FLUX.1 DevFP16–Excellent
Gemma 2 27B InstructQ5_K_M59 tok/sGood
Gemma 2 9B InstructQ8_0106 tok/sExcellent
Gemma 3 12BQ8_080 tok/sGood
Gemma 3 27BQ8_036 tok/sGood
Gemma 3 4BQ8_0228 tok/sExcellent
Gemma 4 26B-A4BQ8_0279 tok/sExcellent
Gemma 4 31BQ8_041 tok/sGood
Gemma 4 E2BQ8_0271 tok/sExcellent
Gemma 4 E4BQ8_0168 tok/sExcellent
GigaChat Lightning 10BQ8_0325 tok/sExcellent
InternLM 2.5 7B ChatQ8_0127 tok/sExcellent
Llama 3.1 70B InstructQ5_K_M23 tok/sAcceptable
Llama 3.1 8B InstructQ8_0122 tok/sExcellent
Llama 3.2 11B VisionQ8_089 tok/sExcellent
Llama 3.2 1B InstructQ8_0471 tok/sExcellent
Llama 3.2 3B InstructQ8_0260 tok/sExcellent
Llama 3.3 70B InstructQ8_014 tok/sAcceptable
Llama 4 ScoutQ5_K_M95 tok/sExcellent
LLaVA 1.6 13BQ5_K_M124 tok/sExcellent
Mistral 7B Instruct v0.3Q8_0136 tok/sExcellent
Mistral Large 2 123BQ5_K_M13 tok/sAcceptable
Mistral Small 24B InstructQ8_041 tok/sGood
Mixtral 8x7B InstructQ5_K_M125 tok/sExcellent
nomic-embed-text v1.5FP162000 tok/sExcellent
NVIDIA Nemotron-3-super-120B-A12BQ4_K_M158 tok/sExcellent
Phi-3 Medium 14B InstructQ8_070 tok/sGood
Phi-3 Mini 3.8B InstructQ8_0257 tok/sExcellent
Phi-4 14BQ8_067 tok/sGood
Phi-4 MiniQ8_0257 tok/sExcellent
Qwen 2.5 14B InstructQ8_066 tok/sGood
Qwen 2.5 72B InstructQ4_K_M26 tok/sAcceptable
Qwen 2.5 7B InstructQ8_0129 tok/sExcellent
Qwen 2.5 Coder 32B InstructQ5_K_M50 tok/sGood
Qwen 2.5 Coder 7B InstructQ8_0129 tok/sExcellent
Qwen3-14B InstructQ8_070 tok/sGood
Qwen3-30B-A3BQ8_0278 tok/sExcellent
Qwen3-32B InstructQ8_031 tok/sGood
Qwen3-8B InstructQ8_0120 tok/sExcellent
Qwen3.5-122B-A10BQ8_098 tok/sExcellent
Qwen3.5-27BQ8_036 tok/sGood
Qwen3.5-397B (MoE)Q2_K–Not viable
Qwen3.6-27BQ8_036 tok/sGood
Qwen3.6-35B-A3BQ5_K_M278 tok/sExcellent
QwQ 32B PreviewQ5_K_M50 tok/sGood
Stable Diffusion 3 MediumFP16–Excellent
Stable Diffusion 3.5 LargeFP16–Excellent
Stable Diffusion XL 1.0FP16–Excellent
StarCoder 2 15BQ8_063 tok/sGood
Whisper Large V3FP16–Excellent
Whisper Large V3 TurboFP16–Excellent
Yi 1.5 34B ChatQ8_029 tok/sAcceptable

Showing 63 of 63 entries

Ready to Buy

Available in these Machines

Buy Used

Prices and availability vary. Inspect hardware before purchasing. Some links may be affiliate links.

FAQ

Frequently Asked Questions

What AI models can NVIDIA RTX PRO 6000 Blackwell run?
The NVIDIA RTX PRO 6000 Blackwell can run 63 AI models. Top performers include all-MiniLM-L6-v2, nomic-embed-text v1.5, Llama 3.2 1B Instruct. See the full compatibility table above for speeds and quality ratings.
Is NVIDIA RTX PRO 6000 Blackwell good for AI coding?
Yes. With 96 GB, the NVIDIA RTX PRO 6000 Blackwell supports the Full AI Builder tier: concurrent coding + reasoning + embeddings.
How much VRAM does NVIDIA RTX PRO 6000 Blackwell have?
The NVIDIA RTX PRO 6000 Blackwell has 96 GB of GDDR7 VRAM with 1800 GB/s bandwidth.
Can NVIDIA RTX PRO 6000 Blackwell run 70B models?
Yes. The NVIDIA RTX PRO 6000 Blackwell can run 70B parameter models in memory at quantized quality.
Is NVIDIA RTX PRO 6000 Blackwell worth it for AI?
At $7,500, the NVIDIA RTX PRO 6000 Blackwell offers 96 GB GDDR7 VRAM and runs 63 AI models. It handles local AI inference well.

Own this GPU?

See every AI model it supports, expected performance, and how to build around it.

Check my rig