CommunityDéveloppement et programmationgithub.com

adewale/skill-eval-harness

Agent Skill evaluation harness for paired variants, trace artifacts, and runner adapters

Qu'est-ce que skill-eval-harness ?

skill-eval-harness is a Claude Code agent skill that agent Skill evaluation harness for paired variants, trace artifacts, and runner adapters.

Compatible avec~Claude Code~Codex CLI~Cursor

npx skills add adewale/skill-eval-harness

Installed? Explore more Développement et programmation skills: steipete/bluebubbles, steipete/eightctl, steipete/blucli · View all 6 →

Voir l'original→Voir toutes les compétences

Demander à votre IA préférée

Ouvre une nouvelle conversation avec cette compétence d'agent déjà préchargée.

ChatGPT Claude Gemini Grok Perplexity DeepSeek

Documentation

Demo Reviewer

A deliberately tiny skill that exists only to demonstrate the harness end to end (materialized ablations included). It has exactly two load-bearing pieces — a severity rule and a checklist reference — each targeted by one ablation below.

Severity rules

Label every finding with one of: Blocking, Minor, or Clean. State the label explicitly so the reader can triage at a glance. A change that ships without a test is at least Blocking.

Evidence

Back each finding with concrete evidence and follow the shared review checklist: see the review checklist.

← More Développement et programmation skills

adewale/skill-eval-harness

Qu'est-ce que skill-eval-harness ?

Demander à votre IA préférée

Documentation

Demo Reviewer

Severity rules

Evidence

Skills associés

steipete/bluebubbles

steipete/eightctl

steipete/blucli

steipete/bear-notes

steipete/camsnap

steipete/gifgrep