FLUX.1 Dev on NVIDIA GeForce RTX 4090
NVIDIA GeForce RTX 4090 runs FLUX.1 Dev excellently at FP16. This is a strong pairing.
Model Size
12B
Device VRAM
24 GB
Bandwidth
1008 GB/s
Quantizations Tested
1
Performance by Quantization
Each row shows FLUX.1 Dev performance at a different quality level on NVIDIA GeForce RTX 4090.
| Quantization | Speed | TTFT | Fits in VRAM | Rating | Confidence |
|---|---|---|---|---|---|
| FP16 | — | — | ✓ Yes | Excellent | estimated |
Notes
FP16
Full FP16 FLUX at 23.8GB fits on 4090 with minimal headroom. ~10-15 seconds per image. Best local image gen quality.
About FLUX.1 Dev
FLUX.1 Dev (12B) is a image gen model. State-of-the-art image generation. Significantly better prompt adherence and image quality than SDXL. 12B parameter transformer model (not diffusion UNet). FP16 requires 24GB VRAM, but Q4 quantization brings it to 8GB GPUs. Non-commercial license.
View all FLUX.1 Dev hardware options →About NVIDIA GeForce RTX 4090
NVIDIA GeForce RTX 4090 has 24 GB at 1008 GB/s. Street price: $1,799.
See all models NVIDIA GeForce RTX 4090 can run →Builds with NVIDIA GeForce RTX 4090
Source: FLUX performance reports (2026-01-15)
Data last updated: 2026-03-01
