timBrockman/prompt-skill-eval
Modular, testable prompt and agent skill library with DeepEval metrics, multi-model CI, and Vercel npx skills compatibility. Prototype reusable skills (SKILL.md + Python registry) for production-grade LLM development.
Modular, testable prompt and agent skill library with DeepEval metrics, multi-model CI, and Vercel npx skills compatibility. Prototype reusable skills (SKILL.md + Python registry) for production-grade LLM development.
npx add-skill timBrockman/prompt-skill-evalModular, testable prompt and agent skill library with DeepEval metrics, multi-model CI, and Vercel npx skills compatibility. Prototype reusable skills (SKILL.md + Python registry) for production-grade LLM development.
Visual-first UI/UX collaboration for AI agents — browser automation, screenshot comparison, wireframes, design tokens, and the 8-step design loop.
A draft-critique toolkit built from Glenn Kramon's Winning Writing at Stanford GSB and Rachel Konrad's cold-outreach guest lectures. 31 Claude skills, a browser Coach with span-level inline critic and refinement chat, plus a Chrome MV3 extension that runs the same critic in the Gmail side panel.
VULK MCP Server — Build, deploy, and export full-stack applications from any AI assistant.
LLM agent skills for software architecture analysis and review. This repository equips AI agents with reusable capabilities to evaluate system design, review architecture decisions, identify risks, analyze trade-offs, and recommend improvements for scalable, reliable, and maintainable systems.
TypeScript AI agent framework — 40+ LLM providers, 100+ tools, multi-agent orchestration, RAG, guardrails, circuit breakers, HITL, budget enforcement. Open-source CrewAI / LangChain alternative. — For Building Production-Grade Multi Agents in Minutes at Scale
Codex skill for API-based image generation through OpenAI-compatible providers