NVIDIA
Laptop GPU
Laptop GPU

NVIDIA RTX 4090 Laptop (150-175W)

16 GB GDDR6 Β· 512 GB/s

Pricing

Included in laptop

Not sold as a standalone component

VRAM

16 GB

Bandwidth

512 GB/s

TDP

150W

Models

34

Tier

Capable

The NVIDIA RTX 4090 Laptop (150-175W) with 16 GB GDDR6 VRAM can handle 34 AI models across coding, ai_coding, ai_building. Best performance: Gemma 4 26B-A4B at 175 tok/s (excellent). For AI coding workflows, it supports the Capable AI Coding tier, handling single model workflows well. Current price has not been announced.

Source: OwnRig methodology

VRAM

16 GB

Bandwidth

512 GB/s

Memory Type

GDDR6

TDP

150W

Form Factor

Laptop (soldered)

Laptop Performance Note

Laptop GPU performance varies by manufacturer, cooling design, and power limits. The tok/s numbers below reflect sustained performance after thermal throttling, not peak. Actual results on your specific laptop may differ by 10-20%.

Builder Capability: Capable AI Coding

Runs 16-22B coding models comfortably, or 32B at reduced quality. Handles single model workflows well.

Software

Inference Backends

The software stacks that matter most for real-world inference on this device.

CUDA

production

Primary high-performance backend for NVIDIA inference workloads.

Vulkan

stable

Fallback backend for llama.cpp and related local runtimes.

What it can run

34 models
Arcee Trinity Mini 26BQ3_K_M49 tok/sExcellent
Arcee Trinity Nano 6BQ8_090 tok/sExcellent
Codestral 22BQ3_K_M15 tok/sAcceptable
DeepSeek Coder V2 Lite 16BQ5_K_M43 tok/sGood
FLUX.1 DevQ4_K_M–Acceptable
Gemma 2 27B InstructQ4_K_M10 tok/sAcceptable
Gemma 3 12BQ5_K_M36 tok/sGood
Gemma 3 27BQ3_K_M5 tok/sMarginal
Gemma 4 26B-A4BQ3_K_M175 tok/sExcellent
Gemma 4 31BQ3_K_M11 tok/sAcceptable
Gemma 4 E2BQ8_077 tok/sExcellent
Gemma 4 E4BQ8_047 tok/sGood
GigaChat Lightning 10BQ8_066 tok/sAcceptable
Llama 3.1 8B InstructQ8_047 tok/sGood
Llama 3.2 11B VisionQ6_K32 tok/sGood
Llama 3.2 1B InstructQ8_0102 tok/sExcellent
Llama 3.2 3B InstructQ8_064 tok/sExcellent
LLaVA 1.6 13BQ4_K_M19 tok/sAcceptable
NVIDIA Nemotron-3-super-120B-A12BQ2_K–Not viable
Phi-3 Medium 14B InstructQ5_K_M24 tok/sAcceptable
Phi-4 14BQ4_K_M24 tok/sAcceptable
Phi-4 MiniQ8_058 tok/sExcellent
Qwen 2.5 14B InstructQ4_K_M26 tok/sGood
Qwen 2.5 Coder 32B InstructQ3_K_M9 tok/sMarginal
Qwen 2.5 Coder 7B InstructQ5_K_M44 tok/sGood
Qwen3-14B InstructQ8_013 tok/sAcceptable
Qwen3.5-122B-A10BQ3_K_M–Not viable
Qwen3.5-27BQ3_K_M30 tok/sAcceptable
Qwen3.5-397B (MoE)Q2_K–Not viable
Qwen3.6-27BQ3_K_M30 tok/sAcceptable
Stable Diffusion 3 MediumFP16–Acceptable
Stable Diffusion 3.5 LargeFP16–Good
StarCoder 2 15BQ5_K_M21 tok/sAcceptable
Whisper Large V3 TurboFP16–Excellent

Showing 34 of 34 entries

Looking for a desktop build?

Desktop GPUs offer higher sustained performance with no thermal throttling. Check our curated desktop builds for dedicated AI workstations.

FAQ

Frequently Asked Questions

What AI models can NVIDIA RTX 4090 Laptop (150-175W) run?
The NVIDIA RTX 4090 Laptop (150-175W) can run 34 AI models. Top performers include Gemma 4 26B-A4B, Llama 3.2 1B Instruct, Arcee Trinity Nano 6B. See the full compatibility table above for speeds and quality ratings.
Is NVIDIA RTX 4090 Laptop (150-175W) good for AI coding?
Yes. With 16 GB, the NVIDIA RTX 4090 Laptop (150-175W) handles single-model coding workflows well at the Capable tier.
How much VRAM does NVIDIA RTX 4090 Laptop (150-175W) have?
The NVIDIA RTX 4090 Laptop (150-175W) has 16 GB of GDDR6 VRAM with 512 GB/s bandwidth.
Can NVIDIA RTX 4090 Laptop (150-175W) run 70B models?
70B models can run on the NVIDIA RTX 4090 Laptop (150-175W) with CPU offloading, but performance will be reduced. Consider a device with 48GB+ inference memory for full-speed 70B inference.
Is NVIDIA RTX 4090 Laptop (150-175W) worth it for AI?
Pricing for NVIDIA RTX 4090 Laptop (150-175W) has not been announced. It offers 16 GB GDDR6 VRAM, but OwnRig should treat recommendations as provisional until pricing and benchmarks are available.

Own this GPU?

See every AI model it supports, expected performance, and how to build around it.

Check my rig