bettyguo/agent_eval

An open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.

Funciona com✓Claude Code✓Codex CLI~Cursor

npx skills add bettyguo/agent_eval

Ver original→Navegar por todas as habilidades

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ChatGPT Claude Gemini Grok Perplexity DeepSeek

Documentação

bettyguo/agent_eval

An open-source benchmark for Claude Code skill bundles (.claude/skills/) and CLAUDE.md configs. Pass@k + cost + reliability, content-addressed leaderboard, runs on Anthropic / OpenAI / Google.

Habilidades Relacionadas

mberneti/clab

Open source Claude skill for automated GitLab MR reviews. Configurable lint rules, semantic analysis, and inline comment posting for self-hosted GitLab instances.

community

clarity-digital-development/tworkflow

A practical, no-hype workflow for AI coding agents: context, plan, implement, review, QA, ship, retro. Templates, two Claude Code skills, and a 40% context rule - every claim traced to official docs.

community

Noninflammatory-tunny848/advertising-skills

Build AI agent skills for paid media, direct-response copy, funnel design, and ad testing that improve conversion and scale campaigns

community

bahayonghang/academic-writing-skills

AI-powered post-writing toolkit for academic papers — format validation, grammar/style polishing, de-AI editing, reference checking, and reviewer-style paper audits. 5 skills for LaTeX, Typst & PDF. Focused on enhancing existing text quality, not generating from scratch.

community

BiswaViraj/agent-skills

Portable agent skills (SKILL.md) for Claude Code, Copilot CLI, Codex & Cursor — reviewloop (clear every PR reviewer), ciloop (fix red CI), standup (daily standup from your PRs). Install per-skill or bundled.

community

brushtyler/living-docs

Gemini CLI skills

community

← More Escrita e Edição skills