OwnRig

Qwen 2.5 14B Instruct on NVIDIA GeForce RTX 4060 Ti 16GB

NVIDIA GeForce RTX 4060 Ti 16GB handles Qwen 2.5 14B Instruct well at 30 tok/s at Q4_K_M. A solid choice for this model.

Model Size

14.77B

Device VRAM

16 GB

Bandwidth

288 GB/s

Quantizations Tested

1

Performance by Quantization

Each row shows Qwen 2.5 14B Instruct performance at a different quality level on NVIDIA GeForce RTX 4060 Ti 16GB.

QuantizationSpeedTTFTFits in VRAMRatingConfidence
Q4_K_M30 tok/s240ms✓ YesGoodestimated

Notes

Q4_K_M

14B at Q4 fits on 16GB. Good general-purpose performance.

About Qwen 2.5 14B Instruct

Qwen 2.5 14B Instruct (14.77B) is a chat, coding, reasoning, multi-purpose model. Strong general-purpose model with excellent coding and reasoning balance.

View all Qwen 2.5 14B Instruct hardware options →

About NVIDIA GeForce RTX 4060 Ti 16GB

NVIDIA GeForce RTX 4060 Ti 16GB has 16 GB at 288 GB/s. Street price: $449.

See all models NVIDIA GeForce RTX 4060 Ti 16GB can run →

Builds with NVIDIA GeForce RTX 4060 Ti 16GB

Source: Community benchmarks and estimated performance (2026-03-01)

Data last updated: 2026-03-15