CommunityProductivity & Collaborationgithub.com

iamalimaybe/llm-evaluation-registry

LLM Evaluation Registry is a backend-led quality layer for AI workflows. It tracks prompts, models, reusable test cases, evaluation runs, validation results, regressions, and human review notes so AI behavior can be measured instead of guessed.

Works with~Claude Code~Codex CLI~Cursor
npx add-skill iamalimaybe/llm-evaluation-registry

iamalimaybe/llm-evaluation-registry

LLM Evaluation Registry is a backend-led quality layer for AI workflows. It tracks prompts, models, reusable test cases, evaluation runs, validation results, regressions, and human review notes so AI behavior can be measured instead of guessed.

Related Skills