bettyguo/agent_eval
An open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
An open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
npx add-skill bettyguo/agent_evalAn open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.
AI Agent Journey Blog - SEO optimized content on AI agent workflows and their impact on modern life
✨ The agentic HTML editor — your local AI agent writes the HTML, you ship it. 🚀 75 Skills × 9 Surfaces (magazine · deck · poster · XHS / tweet · prototype · data report · Hyperframes) 🛡️ Sandboxed preview · 📤 1-click to WeChat / X / Zhihu / HTML / PNG 🔑 Zero API key — Claude Code / Cursor / Codex / Gemini / Copilot / OpenCode / Qwen / Aider.
DNA-First full-stack development methodology for AI editors. Give your AI a playbook — it generates Spring Boot + Vue3 + UniApp production-grade code following structured workflows.
Agent skills for EthereumHistory.com — document, research, and crack Frontier-era Ethereum contracts
MCP server giving Claude live access to SAP systems via ADT — read, edit, search, test, and diff ABAP across landscapes without installing anything on the SAP stack.
X (Twitter) Content Creator Agent Team — Claude Code skill. 4-agent pipeline: Trend Researcher → Creator Manager → Critical Agent (max 5 rounds) → Layout Agent