derberg/eval-bench
Benchmark Claude Code plugins/skills/agents/MCPs by A/B comparing versions with LLM-judged evaluation prompts
Benchmark Claude Code plugins/skills/agents/MCPs by A/B comparing versions with LLM-judged evaluation prompts
npx skills add derberg/eval-benchBenchmark Claude Code plugins/skills/agents/MCPs by A/B comparing versions with LLM-judged evaluation prompts
Agent skill repository.
Recently updated agent-skill-related GitHub repository: Abyss-PlayerEG/api-tree.
Automate Venly tasks via Rube MCP (Composio). Always search tools first for current schemas.
Claude skills + agents vault (extracted from Novra-Core)
实用技巧合集
A portable reference package for AI coding agents — pipeline (sequential stages) + rails (always-on guardrails). Drop-in AGENTS.md gives any agent immediate awareness of available skills, plugins, and MCP servers.