vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Funciona com~Claude Code~Codex CLI~Cursor
npx skills add vllm-project/vllm

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

Documentação

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Habilidades Relacionadas