truera/trulens
Evaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
Evaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
npx skills add truera/trulensEvaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
Build a RAG pipeline using natural language. Give your agent the skill to query complex vector stores through simple chat.
Claude Code skill that derives infrastructure architectures from workload analysis using first-principles math (Little's Law, Erlang-C, USL, availability calculus). 7-dimension constraint framework with swappable cloud/regulatory catalog.
workflow primitives for async AI preparation, observation, analysis, and publishing
Enterprise-grade (40m+ LOC) codebase intelligence, zero-setup, local & private Plugin/Skill/Extension or MCP: hybrid semantic search, polyglot dependency graphs, symbol-level impact analysis & call-flow, interactive HTML viewer, cross-project & branch-aware search, DB/API/infra knowledge. 61% less tokens, 84% fewer calls, 37x faster. Cloud in beta.
Thin Claude Code skill for commercial/market research: triages topics across 12 data domains, detects & guides installing the right MCP source, delegates heavy retrieval to deep-research. Curated source matrix + quality guardrails.
Multi-agent financial trading framework leveraging LLMs to simulate real-world trading firm dynamics