CommunityArte e Designgithub.com

prototypebench/prototypebench

Open benchmark for AI coding agents on full-stack feature shipping (React+Vite+Tailwind/FastAPI+SQLModel). 71 PR-mined tasks · 32k tests · execution-based scoring (pytest+Playwright) · no LLM-as-judge.

Funciona com~Claude Code~Codex CLI~Cursor
npx skills add prototypebench/prototypebench

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

Documentação

prototypebench/prototypebench

Open benchmark for AI coding agents on full-stack feature shipping (React+Vite+Tailwind/FastAPI+SQLModel). 71 PR-mined tasks · 32k tests · execution-based scoring (pytest+Playwright) · no LLM-as-judge.

Habilidades Relacionadas