CommunityArte y diseñogithub.com

vukhanhtruong/claude-eval-skill

A collection of Claude Code skills for evaluating LLM applications end-to-end: prompts, RAG pipelines, tool calls, and agents. Built for prompt engineers, AI builders, and teams who want a repeatable loop of "edit → eval → see scores → improve" instead of vibes-based iteration.

Compatible conClaude Code~Codex CLI~Cursor
npx add-skill vukhanhtruong/claude-eval-skill

vukhanhtruong/claude-eval-skill

A collection of Claude Code skills for evaluating LLM applications end-to-end: prompts, RAG pipelines, tool calls, and agents. Built for prompt engineers, AI builders, and teams who want a repeatable loop of "edit → eval → see scores → improve" instead of vibes-based iteration.

Source: https://github.com/vukhanhtruong/claude-eval-skill

Pushed: 2026-05-12T10:07:30Z Stars: 0

Skills relacionados