Apple
Apple Silicon
Apple Silicon

Apple M3 Pro (18GB Unified)

18 GB Unified Β· 150 GB/s

From

$1,799

Estimated street price

VRAM

18 GB

Bandwidth

150 GB/s

TDP

30W

Models

59

Tier

Capable

The Apple M3 Pro (18GB Unified) with 18 GB unified memory can handle 59 AI models across embedding, ai_building, coding. Best performance: all-MiniLM-L6-v2 at 1200 tok/s (good). For AI coding workflows, it supports the Capable AI Coding tier, handling single model workflows well. Current price: approximately $1,799.

Source: OwnRig methodology

VRAM

18 GB

Bandwidth

150 GB/s

Memory Type

Unified

TDP

30W

GPU Cores

14

Host Devices

MacBook Pro 14", MacBook Pro 16"

Builder Capability: Capable AI Coding

Runs 16-22B coding models comfortably, or 32B at reduced quality. Handles single model workflows well.

Software

Inference Backends

The software stacks that matter most for real-world inference on this device.

Metal

production

Primary Apple Silicon backend across MLX and llama.cpp workloads.

What it can run

59 models
all-MiniLM-L6-v2FP161200 tok/sGood
Arcee Trinity Mini 26BQ4_K_M8 tok/sNot viable
Arcee Trinity Nano 6BQ8_032 tok/sGood
Code Llama 34B InstructQ3_K_M–Not viable
Codestral 22BQ3_K_M–Not viable
Command R 35BQ3_K_M–Not viable
DeepSeek Coder V2 Lite 16BQ4_K_M20 tok/sGood
DeepSeek R1 Distill Qwen 32BQ3_K_M–Not viable
DeepSeek R1 Distill Qwen 7BQ4_K_M14 tok/sAcceptable
DeepSeek V3Q2_K–Not viable
FLUX.1 DevQ4_K_M–Not viable
Gemma 2 27B InstructQ4_K_M–Not viable
Gemma 2 9B InstructQ4_K_M13 tok/sAcceptable
Gemma 3 12BQ3_K_M5 tok/sMarginal
Gemma 3 27BQ3_K_M3 tok/sMarginal
Gemma 3 4BQ4_K_M22 tok/sAcceptable
Gemma 4 26B-A4BQ3_K_M51 tok/sGood
Gemma 4 31BQ3_K_M7 tok/sMarginal
Gemma 4 E2BQ8_022 tok/sAcceptable
Gemma 4 E4BQ8_014 tok/sMarginal
GigaChat Lightning 10BQ8_038 tok/sAcceptable
InternLM 2.5 7B ChatQ4_K_M15 tok/sAcceptable
Llama 3.1 70B InstructQ2_K–Not viable
Llama 3.1 8B InstructQ4_K_M15 tok/sAcceptable
Llama 3.2 1B InstructQ8_045 tok/sGood
Llama 3.2 3B InstructQ8_035 tok/sGood
Llama 3.3 70B InstructQ2_K–Not viable
LLaVA 1.6 13BQ4_K_M8 tok/sAcceptable
Mistral 7B Instruct v0.3Q4_K_M14 tok/sAcceptable
Mistral Small 24B InstructQ3_K_M–Not viable
Mixtral 8x7B InstructQ2_K–Not viable
nomic-embed-text v1.5Q8_0600 tok/sGood
NVIDIA Nemotron-3-super-120B-A12BQ2_K–Not viable
Phi-3 Medium 14B InstructQ3_K_M6 tok/sMarginal
Phi-3 Mini 3.8B InstructQ8_032 tok/sGood
Phi-4 14BQ3_K_M5 tok/sMarginal
Phi-4 MiniQ8_030 tok/sGood
Qwen 2.5 14B InstructQ3_K_M5 tok/sMarginal
Qwen 2.5 72B InstructQ2_K–Not viable
Qwen 2.5 7B InstructQ4_K_M16 tok/sAcceptable
Qwen 2.5 Coder 32B InstructQ3_K_M–Not viable
Qwen 2.5 Coder 7B InstructQ4_K_M15 tok/sAcceptable
Qwen3-14B InstructQ8_02 tok/sMarginal
Qwen3-30B-A3BQ4_K_M3 tok/sMarginal
Qwen3-32B InstructQ3_K_M2 tok/sMarginal
Qwen3-8B InstructQ8_08 tok/sMarginal
Qwen3.5-122B-A10BQ3_K_M–Not viable
Qwen3.5-27BQ4_K_M16 tok/sAcceptable
Qwen3.5-397B (MoE)Q2_K–Not viable
Qwen3.6-27BQ4_K_M16 tok/sAcceptable
Qwen3.6-35B-A3BQ3_K_M3 tok/sMarginal
QwQ 32B PreviewQ3_K_M–Not viable
Stable Diffusion 3 MediumFP16–Good
Stable Diffusion 3.5 LargeFP16–Marginal
Stable Diffusion XL 1.0FP16–Good
StarCoder 2 15BQ3_K_M4 tok/sMarginal
Whisper Large V3Q5_K_M–Good
Whisper Large V3 TurboFP16–Good
Yi 1.5 34B ChatQ3_K_M–Not viable

Showing 59 of 59 entries

Buy Used Mac

Prices and availability vary. Inspect hardware before purchasing. Some links may be affiliate links.

FAQ

Frequently Asked Questions

What AI models can Apple M3 Pro (18GB Unified) run?
The Apple M3 Pro (18GB Unified) can run 59 AI models. Top performers include all-MiniLM-L6-v2, nomic-embed-text v1.5, Gemma 4 26B-A4B. See the full compatibility table above for speeds and quality ratings.
Is Apple M3 Pro (18GB Unified) good for AI coding?
Yes. With 18 GB, the Apple M3 Pro (18GB Unified) handles single-model coding workflows well at the Capable tier.
How much VRAM does Apple M3 Pro (18GB Unified) have?
The Apple M3 Pro (18GB Unified) has 18 GB of unified memory with 150 GB/s bandwidth.
Can Apple M3 Pro (18GB Unified) run 70B models?
70B models can run on the Apple M3 Pro (18GB Unified) with CPU offloading, but performance will be reduced. Consider a GPU with 48GB+ VRAM for full-speed 70B inference.
Is Apple M3 Pro (18GB Unified) worth it for AI?
At $1,799, the Apple M3 Pro (18GB Unified) offers 18 GB VRAM and runs 59 AI models. It works for smaller models and experimentation.

Own this GPU?

See every AI model it supports, expected performance, and how to build around it.

Check my rig