CommunityResearch & Data Analysisgithub.com

linny006/agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

Works with~Claude Code~Codex CLI~Cursor
npx add-skill linny006/agent-eval-harness

linny006/agent-eval-harness

Live, open-source benchmark for comparing AI coding agents on real GitHub issues

Related Skills