Stable Diffusion XL 1.0 on NVIDIA GeForce RTX 4090
NVIDIA GeForce RTX 4090 runs Stable Diffusion XL 1.0 excellently at FP16. This is a strong pairing.
Model Size
6.6B
Device VRAM
24 GB
Bandwidth
1008 GB/s
Quantizations Tested
1
Performance by Quantization
Each row shows Stable Diffusion XL 1.0 performance at a different quality level on NVIDIA GeForce RTX 4090.
| Quantization | Speed | TTFT | Fits in VRAM | Rating | Confidence |
|---|---|---|---|---|---|
| FP16 | — | — | ✓ Yes | Excellent | benchmarked |
Notes
FP16
~3-5 seconds per 1024x1024 image at 30 steps. Massive headroom for LoRA stacking and higher resolutions.
About Stable Diffusion XL 1.0
Stable Diffusion XL 1.0 (6.6B) is a image gen model. The standard for local image generation. 1024x1024 base resolution. Massive LoRA/checkpoint ecosystem. Runs well on 8GB+ VRAM. VRAM usage varies by resolution and batch size — 6.5GB is for single 1024x1024 generation.
View all Stable Diffusion XL 1.0 hardware options →About NVIDIA GeForce RTX 4090
NVIDIA GeForce RTX 4090 has 24 GB at 1008 GB/s. Street price: $1,799.
See all models NVIDIA GeForce RTX 4090 can run →Builds with NVIDIA GeForce RTX 4090
Source: SD WebUI benchmarks (2026-01-15)
Data last updated: 2026-03-01
