truera/trulens
Evaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
Evaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
npx add-skill truera/trulensEvaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
End-to-end agentic research workbench. Turns a topic into a venue-ready manuscript via cross-host orchestration (Claude Code, Codex CLI, Gemini CLI, Antigravity). 17 specialized subagents, Source-Grounded Claim Architecture (SGCA), 23 publisher templates, 7 first-class languages.
Claude Code skill: LLM Wiki knowledge base (Karpathy pattern)
Multi-platform topic intelligence API for AI agents. Install a Skill guide, auto-collect data from Xiaohongshu, GitHub, Twitter and more, then generate structured intel reports locally.
Quorum: orchestrate a swarm of AI experts on any question. Specialists debate, research, and validate — then a polymath supervisor delivers the verdict. One command, multiple minds, stress-tested answers.
Agent skill repository discovered by 10x-chat research.
A collection of skills for Claude / Claude Code