axxafo/awesome-agent-benchmarks

๐Ÿง  Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

์ง€์› ๋Œ€์ƒ~Claude Code~Codex CLI~Cursor
npx skills add axxafo/awesome-agent-benchmarks

๋ฌธ์„œ

axxafo/awesome-agent-benchmarks

๐Ÿง  Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

Source: https://github.com/axxafo/awesome-agent-benchmarks

Pushed: 2026-05-12T06:29:00Z Stars: 3

๊ด€๋ จ ์Šคํ‚ฌ