Communitygithub.com

vukhanhtruong/claude-eval-skill

Name: vukhanhtruong/claude-eval-skill
Author: Community

A collection of Claude Code skills for evaluating LLM applications end-to-end: prompts, RAG pipelines, tool calls, and agents. Built for prompt engineers, AI builders, and teams who want a repeatable loop of "edit → eval → see scores → improve" instead of vibes-based iteration.

¿Qué es claude-eval-skill?

claude-eval-skill is a Claude Code agent skill that a collection of Claude Code skills for evaluating LLM applications end-to-end: prompts, RAG pipelines, tool calls, and agents. Built for prompt engineers, AI builders, and teams who want a repeatable loop of "edit → eval → see scores → improve" instead of vibes-based iteration.

Compatible con✓Claude Code~Codex CLI~Cursor

Part ofDesign & UI

npx skills add vukhanhtruong/claude-eval-skill

Ver original→Ver todas las habilidades

Preguntar en tu IA favorita

Abre un nuevo chat con esta habilidad de agente ya precargada.

ChatGPT Claude Gemini Grok Perplexity DeepSeek

Documentación

¿Qué hace claude-eval-skill?

Source: https://github.com/vukhanhtruong/claude-eval-skill

Pushed: 2026-05-12T10:07:30Z Stars: 0

Skills relacionados

steipete/sag

ElevenLabs text-to-speech with mac-style say UX.

community

steipete/oracle

Oracle CLI second-model review/debug/refactor/design with selected files, dry-run token checks, API or browser engine.

community

steipete/peekaboo

Capture and automate macOS UI with the Peekaboo CLI.

community

obra/brainstorming

You MUST use this before any creative work - creating features, building components, adding functionality, or modifying behavior. Explores user intent, requirements and design before implementation.

community

affaan-m/prisma-patterns

Prisma ORM patterns for TypeScript backends — schema design, query optimization, transactions, pagination, and critical traps like updateMany returning count not records, $transaction timeouts, migrate dev resetting the DB, @updatedAt skipped on bulk writes, and serverless connection exhaustion.

community

affaan-m/django-celery

Django + Celery async task patterns — configuration, task design, beat scheduling, retries, canvas workflows, monitoring, and testing. Use when adding background jobs, scheduled tasks, or async processing to a Django app.

community