HP
Desktop4× GPU
Professional Workstation

HP Z8 Fury G6i (4× RTX PRO 6000 Max-Q, 384 GB)

Windows · Linux

HP Z8 Fury G6i with four NVIDIA RTX PRO 6000 Blackwell Max-Q GPUs (384 GB GDDR7 total). Intel Xeon 698X 86-core, up to 2 TB DDR5-6400. Runs DeepSeek R1 and V3 (671B) at full FP16 precision. Dual 1,700W PSUs. Enterprise-grade local AI inference server.

From

$90,000

Estimated · varies by configuration

Enterprise pricing varies by configuration and region. Confirm quote and availability with HP.

View on HP

Memory

384 GB

GPUs

RAM

2048 GB

Models

63

Type

Desktop

Inference Memory

384 GB

Accelerator

4× 96 GB GDDR7

System RAM

2048 GB

CPU

Intel Xeon 698X (86-core Granite Rapids, 350W)

OS

Windows, Linux

Multi-GPU System

This system has 4 GPUs (384 GB total). Models that fit on a single GPU run at full speed. Larger models require cross-GPU inference — actual throughput depends on the inference engine and interconnect bandwidth.

What it can run

63 models
all-MiniLM-L6-v2FP162760 tok/sExcellent
Arcee Trinity Mini 26BQ8_075 tok/sExcellent
Arcee Trinity Nano 6BQ8_0318 tok/sExcellent
Code Llama 34B InstructQ5_K_M44 tok/sGood
Codestral 22BQ5_K_M67 tok/sGood
Command R 35BQ8_026 tok/sAcceptable
DeepSeek Coder V2 Lite 16BQ8_058 tok/sGood
DeepSeek R1Q2_KNot viable
DeepSeek R1 Distill Qwen 32BQ8_028 tok/sAcceptable
DeepSeek R1 Distill Qwen 7BQ8_0119 tok/sExcellent
DeepSeek V3Q2_KNot viable
FLUX.1 DevFP16Excellent
Gemma 2 27B InstructQ5_K_M54 tok/sGood
Gemma 2 9B InstructQ8_098 tok/sExcellent
Gemma 3 12BQ8_074 tok/sGood
Gemma 3 27BQ8_033 tok/sGood
Gemma 3 4BQ8_0210 tok/sExcellent
Gemma 4 26B-A4BQ8_0279 tok/sExcellent
Gemma 4 31BQ8_041 tok/sGood
Gemma 4 E2BQ8_0271 tok/sExcellent
Gemma 4 E4BQ8_0168 tok/sExcellent
GigaChat Lightning 10BQ8_0299 tok/sExcellent
InternLM 2.5 7B ChatQ8_0117 tok/sExcellent
Llama 3.1 70B InstructQ5_K_M21 tok/sAcceptable
Llama 3.1 8B InstructQ8_0112 tok/sExcellent
Llama 3.2 11B VisionQ8_082 tok/sExcellent
Llama 3.2 1B InstructQ8_0433 tok/sExcellent
Llama 3.2 3B InstructQ8_0239 tok/sExcellent
Llama 3.3 70B InstructQ8_013 tok/sAcceptable
Llama 4 ScoutQ5_K_M87 tok/sExcellent
LLaVA 1.6 13BQ5_K_M114 tok/sExcellent
Mistral 7B Instruct v0.3Q8_0125 tok/sExcellent
Mistral Large 2 123BQ5_K_M12 tok/sAcceptable
Mistral Small 24B InstructQ8_038 tok/sGood
Mixtral 8x7B InstructQ5_K_M115 tok/sExcellent
nomic-embed-text v1.5FP161840 tok/sExcellent
NVIDIA Nemotron-3-super-120B-A12BQ4_K_M145 tok/sExcellent
Phi-3 Medium 14B InstructQ8_064 tok/sGood
Phi-3 Mini 3.8B InstructQ8_0236 tok/sExcellent
Phi-4 14BQ8_062 tok/sGood
Phi-4 MiniQ8_0236 tok/sExcellent
Qwen 2.5 14B InstructQ8_061 tok/sGood
Qwen 2.5 72B InstructQ4_K_M24 tok/sAcceptable
Qwen 2.5 7B InstructQ8_0119 tok/sExcellent
Qwen 2.5 Coder 32B InstructQ5_K_M46 tok/sGood
Qwen 2.5 Coder 7B InstructQ8_0119 tok/sExcellent
Qwen3-14B InstructQ8_064 tok/sGood
Qwen3-30B-A3BQ8_0256 tok/sExcellent
Qwen3-32B InstructQ8_029 tok/sAcceptable
Qwen3-8B InstructQ8_0110 tok/sExcellent
Qwen3.5-122B-A10BQ8_090 tok/sExcellent
Qwen3.5-27BQ8_033 tok/sGood
Qwen3.5-397B (MoE)Q2_KNot viable
Qwen3.6-27BQ8_033 tok/sGood
Qwen3.6-35B-A3BQ5_K_M256 tok/sExcellent
QwQ 32B PreviewQ5_K_M46 tok/sGood
Stable Diffusion 3 MediumFP16Excellent
Stable Diffusion 3.5 LargeFP16Excellent
Stable Diffusion XL 1.0FP16Excellent
StarCoder 2 15BQ8_058 tok/sGood
Whisper Large V3FP16Excellent
Whisper Large V3 TurboFP16Excellent
Yi 1.5 34B ChatQ8_027 tok/sAcceptable

Showing 63 of 63 entries

Best Fit

Who this machine makes sense for

This machine is aimed at team, lab, or enterprise buyers who want a supported system instead of assembling a tower. 384 GB makes it viable for serious local workloads without a DIY build process.

Before You Buy

What to verify first

The main question is not whether the machine works, but whether the price premium is justified by warranty, support, and deployment simplicity versus an equivalent custom build.