CommunityCodierung & Entwicklunggithub.com

trajectoryRL/trajrl-bench

TrajRL-Bench: AI agent skills benchmark. SSH sandbox with mock services, LLM judge scoring, split-half delta evaluation. Leaderboard at trajrl.com/bench

Funktioniert mit~Claude Code~Codex CLI~Cursor

npx skills add trajectoryRL/trajrl-bench

Original anzeigen→Alle Skills durchsuchen

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ChatGPT Claude Gemini Grok Perplexity DeepSeek

Dokumentation

trajectoryRL/trajrl-bench

TrajRL-Bench: AI agent skills benchmark. SSH sandbox with mock services, LLM judge scoring, split-half delta evaluation. Leaderboard at trajrl.com/bench

Verwandte Skills

wakatime/claude-code-wakatime

Track how much time you spend using Claude Code to AI-code

community

tonmoy007/forge-plugins

12-stage gated SDLC orchestrator for Claude Code — deterministic gates, REQ-ID traceability, cost-capped background agents, brownfield adoption

community

jecanore/claude-loadout

The exact Claude Code skill stack I ship with as a solo founder dev — 11 original skills + curated upstream manifest

community

24braids/terrashark

Fix Terraform hallucinations in LLMs by enforcing best practices, modular code, and security for Terraform and OpenTofu configurations.

community

realraelrr/knot-agent

Local-first Codex agent workspace scaffold with skills, runtime checks, IM routing, and filesystem boundaries.

community

ComposioHQ/aeroleads-automation

Automate Aeroleads tasks via Rube MCP (Composio). Always search tools first for current schemas.

community

← More Codierung & Entwicklung skills