NVIDIA
Desktop GPU
Desktop GPU

NVIDIA GeForce RTX 3090

24 GB GDDR6X Β· 936 GB/s

From

$899

Estimated street price

VRAM

24 GB

Bandwidth

936 GB/s

TDP

350W

Models

21

Tier

Power

The NVIDIA GeForce RTX 3090 with 24 GB GDDR6X VRAM can handle 21 AI models across chat, coding, ai_coding. Best performance: Llama 3.2 1B Instruct at 220 tok/s (excellent). For AI coding workflows, it supports the Power AI Coding tier, running 32B coding models at good quality. Current price: approximately $899.

Source: OwnRig methodology

VRAM

24 GB

Bandwidth

936 GB/s

Memory Type

GDDR6X

TDP

350W

Form Factor

3-slot, 313mm

Builder Capability: Power AI Coding

Runs 32B coding models at good quality. Can handle coding model + embeddings concurrently.

Software

Inference Backends

The software stacks that matter most for real-world inference on this device.

CUDA

production

Primary high-performance backend for NVIDIA inference workloads.

Vulkan

stable

Fallback backend for llama.cpp and related local runtimes.

What it can run

21 models
Arcee Trinity Mini 26BQ5_K_M58 tok/sExcellent
Arcee Trinity Nano 6BQ8_0165 tok/sExcellent
DeepSeek V3Q2_K–Not viable
Gemma 3 27BQ4_K_M18 tok/sGood
Gemma 4 26B-A4BQ5_K_M213 tok/sExcellent
Gemma 4 31BQ4_K_M35 tok/sGood
Gemma 4 E2BQ8_0141 tok/sExcellent
Gemma 4 E4BQ8_087 tok/sExcellent
GigaChat Lightning 10BQ8_066 tok/sGood
Llama 3.1 8B InstructQ8_070 tok/sExcellent
Llama 3.2 1B InstructQ8_0220 tok/sExcellent
Llama 3.2 3B InstructQ8_0150 tok/sExcellent
NVIDIA Nemotron-3-super-120B-A12BQ2_K11 tok/sMarginal
Phi-4 MiniQ8_0140 tok/sExcellent
Qwen 2.5 Coder 32B InstructQ4_K_M18 tok/sGood
Qwen3.5-122B-A10BQ3_K_M11 tok/sMarginal
Qwen3.5-27BQ5_K_M24 tok/sGood
Qwen3.5-397B (MoE)Q2_K–Not viable
Qwen3.6-27BQ5_K_M24 tok/sGood
Stable Diffusion 3.5 LargeFP16–Excellent
Whisper Large V3 TurboFP16–Excellent

Showing 21 of 21 entries

Curated Builds

Featured in Builds

Buy Used

Prices and availability vary. Inspect hardware before purchasing. Some links may be affiliate links.

FAQ

Frequently Asked Questions

What AI models can NVIDIA GeForce RTX 3090 run?
The NVIDIA GeForce RTX 3090 can run 21 AI models. Top performers include Llama 3.2 1B Instruct, Gemma 4 26B-A4B, Arcee Trinity Nano 6B. See the full compatibility table above for speeds and quality ratings.
Is NVIDIA GeForce RTX 3090 good for AI coding?
Yes. With 24 GB, the NVIDIA GeForce RTX 3090 supports the Power AI Coding tier: large coding models at good quality.
How much VRAM does NVIDIA GeForce RTX 3090 have?
The NVIDIA GeForce RTX 3090 has 24 GB of GDDR6X VRAM with 936 GB/s bandwidth.
Can NVIDIA GeForce RTX 3090 run 70B models?
70B models can run on the NVIDIA GeForce RTX 3090 with CPU offloading, but performance will be reduced. Consider a GPU with 48GB+ VRAM for full-speed 70B inference.
Is NVIDIA GeForce RTX 3090 worth it for AI?
At $899, the NVIDIA GeForce RTX 3090 offers 24 GB VRAM and runs 21 AI models. It handles local AI inference well.

Own this GPU?

See every AI model it supports, expected performance, and how to build around it.

Check my rig