NVIDIA
Desktop GPUdiscontinued
Desktop GPU

NVIDIA GeForce RTX 3080 10GB

10 GB GDDR6X Β· 760 GB/s

From

$399

Estimated street price

VRAM

10 GB

Bandwidth

760 GB/s

TDP

320W

Models

53

Tier

Limited

The NVIDIA GeForce RTX 3080 10GB with 10 GB GDDR6X VRAM can handle 53 AI models across embedding, ai_building, coding. Best performance: all-MiniLM-L6-v2 at 5000 tok/s (excellent). Current price: approximately $399.

Source: OwnRig methodology

VRAM

10 GB

Bandwidth

760 GB/s

Memory Type

GDDR6X

TDP

320W

Form Factor

3-slot, 285mm

Builder Capability: Limited

Insufficient VRAM for most AI coding workflows.

Software

Inference Backends

The software stacks that matter most for real-world inference on this device.

CUDA

production

Primary high-performance backend for NVIDIA inference workloads.

Vulkan

stable

Fallback backend for llama.cpp and related local runtimes.

What it can run

53 models
all-MiniLM-L6-v2FP165000 tok/sExcellent
Arcee Trinity Mini 26BQ3_K_M11 tok/sNot viable
Arcee Trinity Nano 6BQ8_0134 tok/sExcellent
Code Llama 34B InstructQ2_K–Not viable
Codestral 22BQ3_K_M–Not viable
Command R 35BQ2_K–Not viable
DeepSeek Coder V2 Lite 16BQ4_K_M55 tok/sExcellent
DeepSeek R1 Distill Qwen 32BQ2_K–Not viable
DeepSeek R1 Distill Qwen 7BQ5_K_M48 tok/sExcellent
DeepSeek V3Q2_K–Not viable
FLUX.1 DevQ4_K_M–Not viable
Gemma 2 27B InstructQ3_K_M–Not viable
Gemma 2 9B InstructQ4_K_M45 tok/sExcellent
Gemma 3 12BQ3_K_M28 tok/sAcceptable
Gemma 3 27BQ3_K_M–Not viable
Gemma 3 4BQ5_K_M75 tok/sExcellent
Gemma 4 E2BQ8_0114 tok/sExcellent
Gemma 4 E4BQ8_070 tok/sExcellent
GigaChat Lightning 10BQ4_K_M72 tok/sAcceptable
InternLM 2.5 7B ChatQ5_K_M50 tok/sExcellent
Llama 3.1 70B InstructQ2_K–Not viable
Llama 3.1 8B InstructQ5_K_M50 tok/sExcellent
Llama 3.2 1B InstructQ8_0180 tok/sExcellent
Llama 3.2 3B InstructQ8_0140 tok/sExcellent
Llama 3.3 70B InstructQ2_K–Not viable
LLaVA 1.6 13BQ3_K_M18 tok/sAcceptable
Mistral 7B Instruct v0.3Q5_K_M48 tok/sExcellent
Mistral Small 24B InstructQ2_K–Not viable
Mixtral 8x7B InstructQ2_K–Not viable
nomic-embed-text v1.5Q8_02500 tok/sExcellent
NVIDIA Nemotron-3-super-120B-A12BQ2_K–Not viable
Phi-3 Medium 14B InstructQ3_K_M32 tok/sAcceptable
Phi-3 Mini 3.8B InstructQ8_0130 tok/sExcellent
Phi-4 14BQ3_K_M26 tok/sAcceptable
Phi-4 MiniQ8_0120 tok/sExcellent
Qwen 2.5 14B InstructQ3_K_M24 tok/sAcceptable
Qwen 2.5 72B InstructQ2_K–Not viable
Qwen 2.5 7B InstructQ5_K_M52 tok/sExcellent
Qwen 2.5 Coder 32B InstructQ2_K–Not viable
Qwen 2.5 Coder 7B InstructQ5_K_M50 tok/sExcellent
Qwen3-14B InstructQ4_K_M21 tok/sAcceptable
Qwen3-8B InstructQ8_032 tok/sGood
Qwen3.5-27BQ3_K_M7 tok/sNot viable
Qwen3.5-397B (MoE)Q2_K–Not viable
Qwen3.6-27BQ3_K_M–Not viable
QwQ 32B PreviewQ2_K–Not viable
Stable Diffusion 3 MediumFP16–Excellent
Stable Diffusion 3.5 LargeQ8_0–Acceptable
Stable Diffusion XL 1.0FP16–Excellent
StarCoder 2 15BQ3_K_M22 tok/sAcceptable
Whisper Large V3Q5_K_M–Excellent
Whisper Large V3 TurboFP16–Excellent
Yi 1.5 34B ChatQ2_K–Not viable

Showing 53 of 53 entries

Buy Used

Prices and availability vary. Inspect hardware before purchasing. Some links may be affiliate links.

FAQ

Frequently Asked Questions

What AI models can NVIDIA GeForce RTX 3080 10GB run?
The NVIDIA GeForce RTX 3080 10GB can run 53 AI models. Top performers include all-MiniLM-L6-v2, nomic-embed-text v1.5, Llama 3.2 1B Instruct. See the full compatibility table above for speeds and quality ratings.
Is NVIDIA GeForce RTX 3080 10GB good for AI coding?
With 10 GB, the NVIDIA GeForce RTX 3080 10GB has limited VRAM for AI coding workflows.
How much VRAM does NVIDIA GeForce RTX 3080 10GB have?
The NVIDIA GeForce RTX 3080 10GB has 10 GB of GDDR6X VRAM with 760 GB/s bandwidth.
Can NVIDIA GeForce RTX 3080 10GB run 70B models?
70B models can run on the NVIDIA GeForce RTX 3080 10GB with CPU offloading, but performance will be reduced. Consider a GPU with 48GB+ VRAM for full-speed 70B inference.
Is NVIDIA GeForce RTX 3080 10GB worth it for AI?
At $399, the NVIDIA GeForce RTX 3080 10GB offers 10 GB VRAM and runs 53 AI models. It works for smaller models and experimentation.

Own this GPU?

See every AI model it supports, expected performance, and how to build around it.

Check my rig