OwnRig

Mistral Small 24B Instruct on NVIDIA GeForce RTX 4070 Ti Super

NVIDIA GeForce RTX 4070 Ti Super can run Mistral Small 24B Instruct at 18 tok/s at Q3_K_M, though performance is acceptable. Consider a higher-end GPU for better results.

Model Size

24B

Device VRAM

16 GB

Bandwidth

672 GB/s

Quantizations Tested

1

Performance by Quantization

Each row shows Mistral Small 24B Instruct performance at a different quality level on NVIDIA GeForce RTX 4070 Ti Super.

QuantizationSpeedTTFTFits in VRAMRatingConfidence
Q3_K_M18 tok/s500ms✓ YesAcceptableestimated

Notes

Q3_K_M

Q3 required for 16GB. Usable with quality compromise.

About Mistral Small 24B Instruct

Mistral Small 24B Instruct (24B) is a chat, coding, reasoning model. Mistral's efficient 24B model with strong chat, coding, and reasoning.

View all Mistral Small 24B Instruct hardware options →

About NVIDIA GeForce RTX 4070 Ti Super

NVIDIA GeForce RTX 4070 Ti Super has 16 GB at 672 GB/s. Street price: $779.

See all models NVIDIA GeForce RTX 4070 Ti Super can run →

Source: Community benchmarks and estimated performance (2026-03-01)

Data last updated: 2026-03-15