Supermicro SuperServer (H200) — 1024GB / 8TB / 8×GPU
A ai server built on the NVIDIA H200 141GB with about 1128GB of memory usable for AI — here’s what it runs and how it fits a private AI Business OS.
Configuration
- AI memory
- ~1128GB
- Usable for models
- ~992.6GB
- System RAM
- 1024GB
- Storage
- 8TB
- GPUs
- 8×
- Chip
- NVIDIA H200 141GB
Compatible models
Open models graded for this exact configuration.
- DeepSeek-R1 671B (MoE)DeepSeek · ~671B · 128K ctx · MIT
Fits at Q8_0 (~700GB) with ~292.6GB headroom — about 1 concurrent instance.
Q8_0 · ~700GBRuns well - Llama 3.1 405BLlama · ~405B · 128K ctx · Llama Community License
Fits at FP16 (~810GB) with ~182.6GB headroom — about 1 concurrent instance.
FP16 · ~810GBRuns well - Qwen3 235B-A22B (MoE)Qwen · ~235B · 128K ctx · Apache-2.0
Fits at FP16 (~470GB) with ~522.6GB headroom — about 2 concurrent instances.
FP16 · ~470GBRuns well - Qwen2.5 72BQwen · ~72B · 128K ctx · Qwen License
Fits at FP16 (~145GB) with ~847.6GB headroom — about 6 concurrent instances.
FP16 · ~145GBRuns well - Llama 3.1 70BLlama · ~70B · 128K ctx · Llama Community License
Fits at FP16 (~140GB) with ~852.6GB headroom — about 7 concurrent instances.
FP16 · ~140GBRuns well - Llama 3.3 70BLlama · ~70B · 128K ctx · Llama Community License
Fits at FP16 (~140GB) with ~852.6GB headroom — about 7 concurrent instances.
FP16 · ~140GBRuns well - DeepSeek-R1 Distill Llama 70BDeepSeek · ~70B · 128K ctx · MIT
Fits at FP16 (~140GB) with ~852.6GB headroom — about 7 concurrent instances.
FP16 · ~140GBRuns well - Mixtral 8x7B (MoE)Mistral · ~47B · 32K ctx · Apache-2.0
Fits at FP16 (~90GB) with ~902.6GB headroom — about 11 concurrent instances.
FP16 · ~90GBRuns well
Offers
- Demo StoreDemoIn stock · checked 2026-06-23$320,000Demo only
- Demo MarketplaceDemoIn stock · checked 2026-06-23$332,800Demo only
Pricing below is illustrative demo data — there are no live or affiliate links yet. Offers are kept separate from our editorial AI scores, which they never influence.
Turn the Supermicro SuperServer (H200) into a private AI Business OS
Run your own AI agents on hardware you control — private by design, no per-seat data leaving your premises. BrainOutput helps you pick the right machine and turn it into a working AI Business OS.