AMD
Desktop GPU
Desktop GPU

AMD Radeon RX 9070

16 GB GDDR6 Β· 640 GB/s

From

$549

Estimated street price

VRAM

16 GB

Bandwidth

640 GB/s

TDP

180W

Models

63

Tier

Capable

The AMD Radeon RX 9070 with 16 GB GDDR6 VRAM can handle 63 AI models across embedding, ai_building, coding. Best performance: Gemma 4 26B-A4B at 218 tok/s (excellent). For AI coding workflows, it supports the Capable AI Coding tier, handling single model workflows well. Current price: approximately $549.

Source: OwnRig methodology

VRAM

16 GB

Bandwidth

640 GB/s

Memory Type

GDDR6

TDP

180W

Form Factor

2-slot, 270mm

Builder Capability: Capable AI Coding

Runs 16-22B coding models comfortably, or 32B at reduced quality. Handles single model workflows well.

Software

Inference Backends

The software stacks that matter most for real-world inference on this device.

ROCm

beta

RDNA 4 ROCm support is improving rapidly. Vulkan is the safer default in llama.cpp until ROCm matures further for this architecture.

Vulkan

stable

Most reliable llama.cpp backend for local inference on RDNA 4. Recommended until ROCm 7.x fully stabilises for Navi 48.

What it can run

63 models
all-MiniLM-L6-v2FP16–Excellent
Arcee Trinity Mini 26BQ3_K_M60 tok/sGood
Arcee Trinity Nano 6BQ8_0112 tok/sExcellent
Code Llama 34B InstructQ2_K–Acceptable
Codestral 22BQ3_K_M32 tok/sGood
Command R 35BQ3_K_M4 tok/sMarginal
DeepSeek Coder V2 Lite 16BQ5_K_M88 tok/sGood
DeepSeek R1Q2_K–Not viable
DeepSeek R1 Distill Qwen 32BQ3_K_M8 tok/sMarginal
DeepSeek R1 Distill Qwen 7BQ8_0150 tok/sGood
DeepSeek V3Q2_K–Not viable
FLUX.1 DevQ4_K_M–Acceptable
Gemma 2 27B InstructQ3_K_M22 tok/sAcceptable
Gemma 2 9B InstructQ8_0–Acceptable
Gemma 3 12BQ5_K_M74 tok/sGood
Gemma 3 27BQ3_K_M10 tok/sAcceptable
Gemma 3 4BQ5_K_M34 tok/sGood
Gemma 4 26B-A4BQ3_K_M218 tok/sExcellent
Gemma 4 31BQ3_K_M14 tok/sAcceptable
Gemma 4 E2BQ8_096 tok/sGood
Gemma 4 E4BQ8_058 tok/sGood
GigaChat Lightning 10BQ8_096 tok/sGood
InternLM 2.5 7B ChatQ8_0–Acceptable
Llama 3.1 70B InstructQ2_K–Not viable
Llama 3.1 8B InstructQ8_096 tok/sGood
Llama 3.2 11B VisionQ6_K66 tok/sGood
Llama 3.2 1B InstructQ8_0212 tok/sGood
Llama 3.2 3B InstructQ8_0132 tok/sGood
Llama 3.3 70B InstructQ3_K_M–Not viable
Llama 4 ScoutQ3_K_M–Not viable
LLaVA 1.6 13BQ4_K_M38 tok/sGood
Mistral 7B Instruct v0.3Q8_0–Acceptable
Mistral Large 2 123BQ2_K–Not viable
Mistral Small 24B InstructQ3_K_M32 tok/sGood
Mixtral 8x7B InstructQ2_K4 tok/sMarginal
nomic-embed-text v1.5FP16–Acceptable
NVIDIA Nemotron-3-super-120B-A12BQ2_K–Not viable
Phi-3 Medium 14B InstructQ5_K_M50 tok/sGood
Phi-3 Mini 3.8B InstructQ8_0–Acceptable
Phi-4 14BQ4_K_M50 tok/sGood
Phi-4 MiniQ8_0120 tok/sGood
Qwen 2.5 14B InstructQ4_K_M52 tok/sGood
Qwen 2.5 72B InstructQ2_K–Not viable
Qwen 2.5 7B InstructQ8_0–Acceptable
Qwen 2.5 Coder 32B InstructQ2_K18 tok/sAcceptable
Qwen 2.5 Coder 7B InstructQ5_K_M92 tok/sGood
Qwen3-14B InstructQ5_K_M28 tok/sGood
Qwen3-30B-A3BQ3_K_M–Marginal
Qwen3-32B InstructQ3_K_M4 tok/sMarginal
Qwen3-8B InstructQ8_0–Acceptable
Qwen3.5-122B-A10BQ3_K_M–Not viable
Qwen3.5-27BQ3_K_M44 tok/sGood
Qwen3.5-397B (MoE)Q2_K–Not viable
Qwen3.6-27BQ3_K_M44 tok/sGood
Qwen3.6-35B-A3BQ3_K_M–Marginal
QwQ 32B PreviewQ2_K–Acceptable
Stable Diffusion 3 MediumFP16–Acceptable
Stable Diffusion 3.5 LargeFP16–Acceptable
Stable Diffusion XL 1.0FP16–Excellent
StarCoder 2 15BQ5_K_M44 tok/sGood
Whisper Large V3FP16–Excellent
Whisper Large V3 TurboFP16–Good
Yi 1.5 34B ChatQ3_K_M4 tok/sMarginal

Showing 63 of 63 entries

Buy Used

Prices and availability vary. Inspect hardware before purchasing. Some links may be affiliate links.

FAQ

Frequently Asked Questions

What AI models can AMD Radeon RX 9070 run?
The AMD Radeon RX 9070 can run 63 AI models. Top performers include Gemma 4 26B-A4B, Llama 3.2 1B Instruct, DeepSeek R1 Distill Qwen 7B. See the full compatibility table above for speeds and quality ratings.
Is AMD Radeon RX 9070 good for AI coding?
Yes. With 16 GB, the AMD Radeon RX 9070 handles single-model coding workflows well at the Capable tier.
How much VRAM does AMD Radeon RX 9070 have?
The AMD Radeon RX 9070 has 16 GB of GDDR6 VRAM with 640 GB/s bandwidth.
Can AMD Radeon RX 9070 run 70B models?
70B models can run on the AMD Radeon RX 9070 with CPU offloading, but performance will be reduced. Consider a GPU with 48GB+ VRAM for full-speed 70B inference.
Is AMD Radeon RX 9070 worth it for AI?
At $549, the AMD Radeon RX 9070 offers 16 GB VRAM and runs 63 AI models. It works for smaller models and experimentation.

Own this GPU?

See every AI model it supports, expected performance, and how to build around it.

Check my rig