Community研究與資料分析github.com

linny006/agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

相容平台~Claude Code~Codex CLI~Cursor
npx add-skill linny006/agent-eval-harness

linny006/agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

相關技能