CommunityCoding & Developmentgithub.com

helloJamest/SkillBench

SkillBench is an open-source framework for evaluating and evolving Codex skills with eval-case generation, judge-only/full-agent scoring, traceable dashboards, CI gates, and GEPA-style feedback loops for improving skill docs.

What is SkillBench?

SkillBench is a Codex agent skill that skillBench is an open-source framework for evaluating and evolving Codex skills with eval-case generation, judge-only/full-agent scoring, traceable dashboards, CI gates, and GEPA-style feedback loops for improving skill docs.

Works with~Claude CodeCodex CLI~Cursor
npx skills add helloJamest/SkillBench

Installed? Explore more Coding & Development skills: steipete/bluebubbles, steipete/eightctl, steipete/blucli · View all 6 →

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

Documentation

What does SkillBench do?

SkillBench is an open-source framework for evaluating and evolving Codex skills with eval-case generation, judge-only/full-agent scoring, traceable dashboards, CI gates, and GEPA-style feedback loops for improving skill docs.

Related Skills