Community生产力与协作github.com

iamalimaybe/llm-evaluation-registry

LLM Evaluation Registry is a backend-led quality layer for AI workflows. It tracks prompts, models, reusable test cases, evaluation runs, validation results, regressions, and human review notes so AI behavior can be measured instead of guessed.

兼容平台~Claude Code~Codex CLI~Cursor
npx add-skill iamalimaybe/llm-evaluation-registry

iamalimaybe/llm-evaluation-registry

LLM Evaluation Registry is a backend-led quality layer for AI workflows. It tracks prompts, models, reusable test cases, evaluation runs, validation results, regressions, and human review notes so AI behavior can be measured instead of guessed.

相关技能