Command R 35B on NVIDIA GeForce RTX 4070 Ti 12GB
NVIDIA GeForce RTX 4070 Ti 12GB cannot run Command R 35B at any quantization level. The 12 GB of VRAM is insufficient.
Model Size
35B
Device VRAM
12 GB
Bandwidth
504 GB/s
Quantizations Tested
1
Performance by Quantization
Each row shows Command R 35B performance at a different quality level on NVIDIA GeForce RTX 4070 Ti 12GB.
| Quantization | Speed | TTFT | Fits in VRAM | Rating | Confidence |
|---|---|---|---|---|---|
| Q2_K | — | — | ✗ Offload | Not Viable | estimated |
Notes
Q2_K
Requires 24GB+ for viable quality.
About Command R 35B
Command R 35B (35B) is a chat, reasoning, multi-purpose model. Cohere's RAG-optimized model with strong reasoning and long-context support.
View all Command R 35B hardware options →About NVIDIA GeForce RTX 4070 Ti 12GB
NVIDIA GeForce RTX 4070 Ti 12GB has 12 GB at 504 GB/s. Street price: $749.
See all models NVIDIA GeForce RTX 4070 Ti 12GB can run →Source: 34.8B model exceeds 12GB (2026-03-15)
Data last updated: 2026-03-15