derberg/eval-bench
Benchmark Claude Code plugins/skills/agents/MCPs by A/B comparing versions with LLM-judged evaluation prompts
Benchmark Claude Code plugins/skills/agents/MCPs by A/B comparing versions with LLM-judged evaluation prompts
npx skills add derberg/eval-benchBenchmark Claude Code plugins/skills/agents/MCPs by A/B comparing versions with LLM-judged evaluation prompts
Automate Minerstat tasks via Rube MCP (Composio). Always search tools first for current schemas.
Automate Firmao tasks via Rube MCP (Composio). Always search tools first for current schemas.
Agent skill repository: scarr-exe/fhevm-agent-skill
An experimental ReAct agent implemented in Go.
Open-source toolkit for the QVeris capability routing network: CLI, MCP server, Python SDK, skills, and REST API docs for agents to discover, inspect, call, and audit real-world tools.
Agent Skill 设计案例库 — 从已发布 skill 反推产品决策与工程范式,沉淀 AI-native API 设计法则。首个案例:微信读书 Skill 拆解。