Community艺术与设计github.com

prototypebench/prototypebench

Open benchmark for AI coding agents on full-stack feature shipping (React+Vite+Tailwind/FastAPI+SQLModel). 71 PR-mined tasks · 32k tests · execution-based scoring (pytest+Playwright) · no LLM-as-judge.

兼容平台~Claude Code~Codex CLI~Cursor
npx add-skill prototypebench/prototypebench

prototypebench/prototypebench

Open benchmark for AI coding agents on full-stack feature shipping (React+Vite+Tailwind/FastAPI+SQLModel). 71 PR-mined tasks · 32k tests · execution-based scoring (pytest+Playwright) · no LLM-as-judge.

相关技能