vLLMvLLM/Recipes
BrowseDocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16NVIDIA-Nemotron-3-Super-120B-A12B-BF16NVIDIA-Nemotron-3-Nano-4B-BF16NVIDIA-Nemotron-3-Nano-30B-A3B-BF16NVIDIA-Nemotron-Nano-12B-v2-VL-BF16NVIDIA-Nemotron-Nano-9B-v2
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Poolside
Qwen
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

NVIDIA

nvidia·6 recipesHuggingFace

Multimodal

2
Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
31B / 3B
moe
BF1675GFP838GNVFP428G
v0.20.0+→
NVIDIA-Nemotron-Nano-12B-v2-VL-BF16
12B
dense
BF1629GFP814GNVFP48G
v0.11.1+→

Text

4
NVIDIA-Nemotron-3-Super-120B-A12B-BF16
120B / 12B
moe
BF16298GFP8149GNVFP475GBF16298G
v0.17.1+→
NVIDIA-Nemotron-3-Nano-4B-BF16
4B
dense
BF1610GFP85G
v0.11.2+→
NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
30B / 3B
moe
BF1672GFP835G
v0.11.2+→
NVIDIA-Nemotron-Nano-9B-v2
9B
dense
BF1622GFP811GNVFP46GBF1622GBF1622G
v0.10.1+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API