CommunityArte e Designgithub.com

EurecaMoment/BenchClaw

BenchClaw is a Codex/OpenCode skill workflow for benchmark construction, evaluation, and maintenance. It standardizes the full pipeline—from idea drafting and data generation to evaluation, reporting, failure diagnosis, and skill refinement—so agents can build reproducible, auditable benchmarks with clear quality gates, lineage, and rollback.

Funciona com~Claude CodeCodex CLI~CursorOpenCode
npx skills add EurecaMoment/BenchClaw

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

Documentação

EurecaMoment/BenchClaw

BenchClaw is a Codex/OpenCode skill workflow for benchmark construction, evaluation, and maintenance. It standardizes the full pipeline—from idea drafting and data generation to evaluation, reporting, failure diagnosis, and skill refinement—so agents can build reproducible, auditable benchmarks with clear quality gates, lineage, and rollback.

Habilidades Relacionadas