sx4im/skillcheck

๐Ÿงช A/B test your agent skills โ€” find out if your SKILL.md actually helps the model or is just a placebo. Blind-graded trials, bootstrap confidence intervals, one verdict: HELPS / PLACEBO / HARMS. For Claude Code, Codex, Gemini CLI & Cursor skills.

์ง€์› ๋Œ€์ƒโœ“Claude Codeโœ“Codex CLIโœ“Cursorโœ“Gemini CLI
npx skills add sx4im/skillcheck

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

๋ฌธ์„œ

sx4im/skillcheck

๐Ÿงช A/B test your agent skills โ€” find out if your SKILL.md actually helps the model or is just a placebo. Blind-graded trials, bootstrap confidence intervals, one verdict: HELPS / PLACEBO / HARMS. For Claude Code, Codex, Gemini CLI & Cursor skills.

๊ด€๋ จ ์Šคํ‚ฌ