Nomic · Apache 2.0
High-quality text embedding model for RAG pipelines. 137M params — negligible VRAM. Competitive with OpenAI's ada-002 on MTEB benchmarks. Essential for builders running local RAG with Cursor or similar tools. Can run concurrently with coding models without meaningful VRAM impact.
nomic-embed-text v1.5 (137M) requires 410 MB VRAM at recommended quality (Q8_0). On NVIDIA GeForce RTX 4070 Ti 12GB, expect approximately 6500 tok/s at Q8_0. For the best experience, Starter AI Desktop ($582) is recommended.
— OwnRig methodology, data updated 2026-03-01
| Quality | Quantization | VRAM | File Size |
|---|---|---|---|
| full | FP16 | 512 MB | 0.27 GB |
| recommended | Q8_0 | 410 MB | 0.14 GB |
KV cache VRAM at Q8_0 quality. Longer context = more memory.
| Context | KV Cache | Total VRAM |
|---|---|---|
| 2K | 0 MB | 410 MB |
| 4K | 0 MB | 410 MB |
| 8K | 102 MB | 512 MB |
Performance data for nomic-embed-text v1.5 across different hardware.
| Device | Quantization | Speed | Rating | Fits in VRAM |
|---|---|---|---|---|
| NVIDIA GeForce RTX 3060 12GB | FP16 | — | Excellent | ✓ |
| NVIDIA GeForce RTX 4090 | FP16 | — | Excellent | ✓ |
| Apple M4 Max (64GB Unified) | FP16 | — | Excellent | ✓ |
| NVIDIA GeForce RTX 4070 Super | FP16 | — | Excellent | ✓ |
| Apple M4 Pro (48GB) | FP16 | — | Excellent | ✓ |
| NVIDIA GeForce RTX 4060 8GB | Q8_0 | 4200 tok/s | Excellent | ✓ |
| NVIDIA GeForce RTX 4070 Ti 12GB | Q8_0 | 6500 tok/s | Excellent | ✓ |
| NVIDIA GeForce RTX 3080 10GB | Q8_0 | 2500 tok/s | Excellent | ✓ |
| Apple M3 Pro (18GB Unified) | Q8_0 | 600 tok/s | Good | ✓ |
nomic-embed-text v1.5 is commonly used with Cursor, Continue, AnythingLLM, Open WebUI.
Complete PC builds that can run nomic-embed-text v1.5.

NVIDIA GeForce RTX 4090 · 64GB DDR5-5600 (2x32GB)

NVIDIA GeForce RTX 3060 12GB · 32GB DDR4-3200 (2x16GB)

NVIDIA GeForce RTX 4060 Ti 16GB · 32GB DDR5-5200 (2x16GB)

NVIDIA GeForce RTX 4070 Super 12GB · 32GB DDR5-5600 (2x16GB)

2x NVIDIA GeForce RTX 3090 24GB (Used) + NVLink Bridge · 128GB DDR5-5600 (4x32GB)

Apple M4 Max 128GB (Mac Studio)

NVIDIA GeForce RTX 4060 Ti 16GB · 32GB DDR5-5600 (2x16GB)

NVIDIA GeForce RTX 3090 24GB (Used) · 64GB DDR5-5600 (2x32GB)

NVIDIA GeForce RTX 4060 Ti 16GB · 32GB DDR5-5600 (2x16GB)
Data confidence: verified. Last updated: 2026-03-01. Source