vukhanhtruong/claude-eval-skill
A collection of Claude Code skills for evaluating LLM applications end-to-end: prompts, RAG pipelines, tool calls, and agents. Built for prompt engineers, AI builders, and teams who want a repeatable loop of "edit → eval → see scores → improve" instead of vibes-based iteration.
Source: https://github.com/vukhanhtruong/claude-eval-skill
Pushed: 2026-05-12T10:07:30Z Stars: 0