Community研究&データ分析github.com

axxafo/awesome-agent-benchmarks

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

対応~Claude Code~Codex CLI~Cursor
npx skills add axxafo/awesome-agent-benchmarks

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ドキュメント

axxafo/awesome-agent-benchmarks

🧠 Discover and evaluate advanced benchmark datasets for Large Language Model agents to enhance performance assessment in real-world tasks.

Source: https://github.com/axxafo/awesome-agent-benchmarks

Pushed: 2026-05-12T06:29:00Z Stars: 3

関連スキル