bettyguo/agent_eval
An open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
An open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
npx add-skill bettyguo/agent_evalAn open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
LLM-ready web documentation resolver: Python cascade skill + web + Rust CLI (wdr) with semantic cache, multi-provider routing, and quality synthesis
A self-evolving coding agent written in Go. Reads its own source, decides what to improve, writes code, runs tests, and commits — autonomously.
🧠 Create and execute code swiftly with Vibecode Editor, an AI-powered web IDE that enhances development with real-time feedback and a user-friendly interface.
🎙️ Stand-up comedy skill for AI agents — Chinese talkshow script writing with SOP, structure techniques, and paralanguage encoding
Agent skill for querying, creating, editing, exporting and deleting flomo memos
Markdown-first web retrieval skill for AI agents (Cloudflare + Jina + Firecrawl)