Communityコーディング&開発github.com

trajectoryRL/trajrl-bench

TrajRL-Bench: AI agent skills benchmark. SSH sandbox with mock services, LLM judge scoring, split-half delta evaluation. Leaderboard at trajrl.com/bench

対応~Claude Code~Codex CLI~Cursor
npx skills add trajectoryRL/trajrl-bench

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ドキュメント

trajectoryRL/trajrl-bench

TrajRL-Bench: AI agent skills benchmark. SSH sandbox with mock services, LLM judge scoring, split-half delta evaluation. Leaderboard at trajrl.com/bench

関連スキル