Communitygithub.com

prototypebench/prototypebench

Name: prototypebench/prototypebench
Author: Community

Open benchmark for AI coding agents on full-stack feature shipping (React+Vite+Tailwind/FastAPI+SQLModel). 71 PR-mined tasks · 32k tests · execution-based scoring (pytest+Playwright) · no LLM-as-judge.

What is prototypebench?

prototypebench is a Claude Code agent skill that open benchmark for AI coding agents on full-stack feature shipping (React+Vite+Tailwind/FastAPI+SQLModel). 71 PR-mined tasks · 32k tests · execution-based scoring (pytest+Playwright) · no LLM-as-judge.

Works with~Claude Code~Codex CLI~Cursor

Part ofDesign & UI

npx skills add prototypebench/prototypebench

View original→Browse all skills

Ask in your favorite AI

Open a new chat with this agent skill pre-loaded.

ChatGPT Claude Gemini Grok Perplexity DeepSeek

Documentation

What does prototypebench do?

Related Skills

steipete/sag

ElevenLabs text-to-speech with mac-style say UX.

community

steipete/oracle

Oracle CLI second-model review/debug/refactor/design with selected files, dry-run token checks, API or browser engine.

community

steipete/peekaboo

Capture and automate macOS UI with the Peekaboo CLI.

community

obra/brainstorming

You MUST use this before any creative work - creating features, building components, adding functionality, or modifying behavior. Explores user intent, requirements and design before implementation.

community

affaan-m/prisma-patterns

Prisma ORM patterns for TypeScript backends — schema design, query optimization, transactions, pagination, and critical traps like updateMany returning count not records, $transaction timeouts, migrate dev resetting the DB, @updatedAt skipped on bulk writes, and serverless connection exhaustion.

community

affaan-m/django-celery

Django + Celery async task patterns — configuration, task design, beat scheduling, retries, canvas workflows, monitoring, and testing. Use when adding background jobs, scheduled tasks, or async processing to a Django app.

community