CommunityArte y diseñogithub.com

vukhanhtruong/claude-eval-plugin

A collection of Claude Code skills for evaluating LLM applications end-to-end: prompts, RAG pipelines, tool calls, and agents. Built for prompt engineers, AI builders, and teams who want a repeatable loop of "edit → eval → see scores → improve" instead of vibes-based iteration.

Compatible conClaude Code~Codex CLI~Cursor
npx add-skill vukhanhtruong/claude-eval-plugin

vukhanhtruong/claude-eval-plugin

A collection of Claude Code skills for evaluating LLM applications end-to-end: prompts, RAG pipelines, tool calls, and agents. Built for prompt engineers, AI builders, and teams who want a repeatable loop of "edit → eval → see scores → improve" instead of vibes-based iteration.

Skills relacionados