CommunityProgramación y desarrollogithub.com

sx4im/skillcheck

🧪 A/B test your agent skills — find out if your SKILL.md actually helps the model or is just a placebo. Blind-graded trials, bootstrap confidence intervals, one verdict: HELPS / PLACEBO / HARMS. For Claude Code, Codex, Gemini CLI & Cursor skills.

Compatible conClaude CodeCodex CLICursorGemini CLI
npx add-skill sx4im/skillcheck

sx4im/skillcheck

🧪 A/B test your agent skills — find out if your SKILL.md actually helps the model or is just a placebo. Blind-graded trials, bootstrap confidence intervals, one verdict: HELPS / PLACEBO / HARMS. For Claude Code, Codex, Gemini CLI & Cursor skills.

Skills relacionados