OwnRig

DeepSeek Coder V2 Lite 16B on Apple M3 Pro (18GB Unified)

Apple M3 Pro (18GB Unified) handles DeepSeek Coder V2 Lite 16B well at 20 tok/s at Q4_K_M. A solid choice for this model.

Model Size

15.7B

Device VRAM

18 GB

Bandwidth

150 GB/s

Quantizations Tested

1

Performance by Quantization

Each row shows DeepSeek Coder V2 Lite 16B performance at a different quality level on Apple M3 Pro (18GB Unified).

QuantizationSpeedTTFTFits in VRAMRatingConfidence
Q4_K_M20 tok/s300ms✓ YesGoodestimated

Notes

Q4_K_M

MoE 9.1GB fits in 14GB effective. Good coding performance despite low bandwidth.

About DeepSeek Coder V2 Lite 16B

DeepSeek Coder V2 Lite 16B (15.7B) is a coding, ai coding, ai building model. MoE architecture — 15.7B total, ~2.4B active per token. Excellent code generation and completion. Extremely fast inference despite total param count. One of the best coding models for its effective size.

View all DeepSeek Coder V2 Lite 16B hardware options →

About Apple M3 Pro (18GB Unified)

Apple M3 Pro (18GB Unified) has 18 GB at 150 GB/s. Available in MacBook Pro 14", MacBook Pro 16".

See all models Apple M3 Pro (18GB Unified) can run →

Source: MLX performance estimates (2026-03-15)

Data last updated: 2026-03-01