trajectoryRL/trajrl-bench

TrajRL-Bench: AI agent skills benchmark. SSH sandbox with mock services, LLM judge scoring, split-half delta evaluation. Leaderboard at trajrl.com/bench

지원 대상~Claude Code~Codex CLI~Cursor

npx skills add trajectoryRL/trajrl-bench

원본 보기→모든 스킬 둘러보기

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ChatGPT Claude Gemini Grok Perplexity DeepSeek

문서

trajectoryRL/trajrl-bench

TrajRL-Bench: AI agent skills benchmark. SSH sandbox with mock services, LLM judge scoring, split-half delta evaluation. Leaderboard at trajrl.com/bench

관련 스킬

LeventLei/my-claude-skills

Personal Claude Code skills collection

community

PatrickSUDO/fadacai-portfolio

Claude Code 投資研究與組合管理框架：skills + MCP + 第一性原理紀律 + thesis ledger

community

Synthadoc: An open-source LLM knowledge compilation engine that turns raw documents into structured, local-first wikis. A transparent, human-readable alternative to traditional RAG, which can be self-managed and self-improved without the use of any tools.

community

← More 코딩 & 개발 skills

trajectoryRL/trajrl-bench

Ask in your favorite AI

문서

trajectoryRL/trajrl-bench

관련 스킬

LeventLei/my-claude-skills

PatrickSUDO/fadacai-portfolio

axoviq-ai/synthadoc

Ficere/pipeline-assessment

wandanan/dazi-todo-releases

Yeachan-Heo/oh-my-claudecode