vLLMvLLM/Recipes
BrowseDocsGitHub
Providers
Arcee AI
Ernie (Baidu)
Seed (ByteDance)
DeepSeek
Google
inclusionAI
InternLM
Jina AI
Meituan LongCat
Meta
Llama-4-Scout-17B-16E-InstructLlama-3.3-70B-InstructLlama-3.1-8B-Instruct
Microsoft
MiniMax
Mistral AI
Moonshot AI
NVIDIA
OpenAI
InternVL (OpenGVLab)
PaddlePaddle
Poolside
Qwen
Stability AI
StepFun
Tencent Hunyuan
Wan AI
Xiaomi MiMo
GLM (Z-AI)

Meta

meta-llama·3 recipesHuggingFace

Text

3
Llama-4-Scout-17B-16E-Instruct
109B / 17B
moe
BF16262GFP8131GNVFP465G
v0.12.0+→
Llama-3.3-70B-Instruct
70B
dense
BF16170GFP884GNVFP442G
v0.12.0+→
Llama-3.1-8B-Instruct
8B
dense
BF1620GNVFP45GFP810G
v0.6.0+→
GitHubRequest a recipeDocumentationSupported Models & HardwareInstall vLLMJSON API