Community研究与数据分析github.com

axxafo/awesome-agent-benchmarks

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

兼容平台~Claude Code~Codex CLI~Cursor
npx add-skill axxafo/awesome-agent-benchmarks

axxafo/awesome-agent-benchmarks

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

Source: https://github.com/axxafo/awesome-agent-benchmarks

Pushed: 2026-05-12T06:29:00Z Stars: 3

相关技能