linny006/agent-eval-harness
Live, open-source benchmark for comparing AI coding agents on real GitHub issues
Live, open-source benchmark for comparing AI coding agents on real GitHub issues
npx add-skill linny006/agent-eval-harnessLive, open-source benchmark for comparing AI coding agents on real GitHub issues
🧠 Enhance AI conversations with Cognio, a persistent memory server that retains context and enables meaningful semantic search across sessions.
Audit AI coding agents and project configurations for security flaws using static analysis, custom rules, and LLM-assisted verification.
A strict AI context skill (skills.sh compatible) defining architectural rules, UI limits, and error prevention guidelines for generating WhatsApp Flows and Data Exchange Endpoints in Botmaker.
Agent skill repository discovered via 10x-chat provider research.
Multi-skill Claude Code plugin for Karpathy-style interlinked markdown knowledge bases. Hermes Agent llm-wiki wire-compatible.
A memory-first AI agent that remembers why decisions were made — not just the last message. Runs local (Ollama), cloud (Claude · OpenAI · Gemini), or decentralized TEE. Graph memory, self-learning skills, multi-model routing, sandboxed tools. MCP · ACP · A2A. One Rust binary.