Community研究&データ分析github.com

linny006/agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

対応~Claude Code~Codex CLI~Cursor
npx skills add linny006/agent-eval-harness

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ドキュメント

linny006/agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

関連スキル