bettyguo/agent_eval
An open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
An open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
npx skills add bettyguo/agent_evalAn open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
Skill audit tracker, changelogs, and reference docs for the Claude AI skill ecosystem — Wigglers Room project
Terminal-native AI content creation workflow tool.
Agent skills for Obsidian. Teach your agent to use Markdown, Bases, JSON Canvas, and use the CLI.
Turn an idea or article into original editorial illustrations starring an avatar you own — a free agent skill (Claude Code, Codex, Cursor, Gemini) + a desktop app.
Searchable football data provider documentation for AI coding agents. Like Context7 for football data.
Your markdown vault, compiled into a 6-persona MCP team for Claude Code, Codex, OpenCode, and Gemini CLI. Headless-first. Cites, doesn't guess.