iamalimaybe/llm-evaluation-registry

LLM Evaluation Registry is a backend-led quality layer for AI workflows. It tracks prompts, models, reusable test cases, evaluation runs, validation results, regressions, and human review notes so AI behavior can be measured instead of guessed.

対応~Claude Code~Codex CLI~Cursor
npx skills add iamalimaybe/llm-evaluation-registry

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ドキュメント

iamalimaybe/llm-evaluation-registry

LLM Evaluation Registry is a backend-led quality layer for AI workflows. It tracks prompts, models, reusable test cases, evaluation runs, validation results, regressions, and human review notes so AI behavior can be measured instead of guessed.

関連スキル