Community研究与数据分析github.com

linny006/agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

兼容平台~Claude Code~Codex CLI~Cursor
npx add-skill linny006/agent-eval-harness

linny006/agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

相关技能