CommunityProgramación y desarrollogithub.com

helloJamest/SkillBench

SkillBench is an open-source framework for evaluating and evolving Codex skills with eval-case generation, judge-only/full-agent scoring, traceable dashboards, CI gates, and GEPA-style feedback loops for improving skill docs.

¿Qué es SkillBench?

SkillBench is a Codex agent skill that skillBench is an open-source framework for evaluating and evolving Codex skills with eval-case generation, judge-only/full-agent scoring, traceable dashboards, CI gates, and GEPA-style feedback loops for improving skill docs.

Compatible con~Claude CodeCodex CLI~Cursor
npx skills add helloJamest/SkillBench

Installed? Explore more Programación y desarrollo skills: steipete/bluebubbles, steipete/eightctl, steipete/blucli · View all 6 →

Preguntar en tu IA favorita

Abre un nuevo chat con esta habilidad de agente ya precargada.

Documentación

¿Qué hace SkillBench?

SkillBench is an open-source framework for evaluating and evolving Codex skills with eval-case generation, judge-only/full-agent scoring, traceable dashboards, CI gates, and GEPA-style feedback loops for improving skill docs.

Skills relacionados