Community아트 & 디자인github.com

prototypebench/prototypebench

Open benchmark for AI coding agents on full-stack feature shipping (React+Vite+Tailwind/FastAPI+SQLModel). 71 PR-mined tasks · 32k tests · execution-based scoring (pytest+Playwright) · no LLM-as-judge.

지원 대상~Claude Code~Codex CLI~Cursor
npx skills add prototypebench/prototypebench

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

문서

prototypebench/prototypebench

Open benchmark for AI coding agents on full-stack feature shipping (React+Vite+Tailwind/FastAPI+SQLModel). 71 PR-mined tasks · 32k tests · execution-based scoring (pytest+Playwright) · no LLM-as-judge.

관련 스킬