truera/trulens
Evaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
Evaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
npx skills add truera/trulensEvaluation tools measuring hallucination rates, toxic deviations, and context relevance throughout agent loops.
An auto-fixing linter, activation tester, and CI gatekeeper for Claude Code & Codex skills.
Business operating system for Claude Code — 25 skills, 13 agents, smart daemon. Unified inbox (WhatsApp/Email/Slack/Telegram), autonomous PR merge, full-AWS monitoring, revenue (Stripe+RevenueCat), e-commerce (Shopify), marketing (Klaviyo/Meta/GA4), voice (Bland/ElevenLabs), APM (Datadog/NewRelic/OTEL), YOLO mode.
Analyze market data and execute perpetual futures trades on Hyperliquid using an AI agent powered by Claude with built-in risk management.
The bedrock layer for AI coding agents. One governance.md. Any project. Never stale. Universal skills + cross-agent compilation (Claude, Cursor, Codex, Gemini, Aider).
API-first semantic engine and query planner for AI agents that compiles declarative YAML models into optimized, dialect-specific SQL across BigQuery, PostgreSQL, Snowflake, ClickHouse, Dremio, Databricks, DuckDB, and MySQL.
Local skill retrieval library for coding agents: lexical search, Chroma vector search, warm daemon mode, and learning-loop logs.