Complete directory of AI agents, frameworks, platforms, and tools — March 2026 Edition. Covers 100+ tools across coding agents, multi-agent patterns, MCP/A2A protocols, memory, observability, and security.
57% of enterprises have AI agents in production. 73% of Fortune 500 are deploying multi-agent workflows. This list cuts through the noise.
Agent Frameworks (Production-Ready)
Agent Platforms & Products
Coding Agents
Protocols & Standards
Multi-Agent Patterns
Agent Memory
Agent Observability
Agent Security & Guardrails
Voice Agents
Agent Deployment
RAG & Knowledge
Key Stats (March 2026)
Related Awesome Lists
1. Agent Frameworks (Production-Ready)
Framework
Stars
Key Feature
Best For
LangGraph 1.0.8
19K+
Durable execution, DAG-based state machines
Complex stateful workflows, production pipelines
CrewAI 2.x
38K+
Role-based multi-agent crews, task delegation
Team simulations, parallel agent collaboration
OpenAI Agents SDK
12K+
Routines + handoffs (replaces Swarm)
OpenAI-native agent workflows, voice integration
Claude Agent SDK
—
MCP-native, hooks, memory, skills, worktrees
Anthropic-native, Claude Code extensions
Google ADK
8K+
A2A protocol native, Vertex AI integration
Google Cloud agents, A2A interoperability
Microsoft AutoGen 0.4+
50K+
Multi-agent conversation, actor model
Research, enterprise multi-agent dialogue
Pydantic AI
11K+
Type-safe agents, validated structured outputs
Production APIs, strict output contracts
SmolAgents
18K+
Minimalist CodeAgent, ~1K lines core
Learning, lightweight deployments, HuggingFace models
AG2
7K+
Community AutoGen fork, faster releases
AutoGen users wanting community-driven roadmap
LangChain
116K+
Massive ecosystem, 1000+ integrations
Prototyping, integration-heavy workflows
Agno
35K+
Multimodal agents, runtime + control plane
Multimodal tasks, agent observability
Composio
27K+
100+ tool integrations, MCP support
Tool-heavy agents, SaaS automation
Local / Open-Source Runtimes
Framework
Stars
Key Feature
Best For
Open Interpreter
56K+
Natural language → code execution
Local automation, full OS control
Dify
60K+
Visual workflow builder, self-hosted
Teams wanting GUI agent builder
Flowise
32K+
Drag-and-drop node editor
Low-code agent building
Goose
10K+
Block's extensible agent, MCP-native
Developer automation on local machines
2. Agent Platforms & Products
Autonomous Agents (Cloud)
Platform
Backing
Key Feature
Notable
Devin 2.0
Cognition
Autonomous SWE, parallel instances, 30-day project memory
Solves 13.86% SWE-bench (Pro) autonomously
Manus AI
Meta ($2B acq.)
Iterate-loop multi-agent, 100+ tool integrations
Most viral agent demo of early 2026
OpenAI Deep Research
OpenAI
5–30 min autonomous research sessions, citation-backed
GPT-4o + o3 backbone, best research agent
Google Deep Research
Google
Gemini 2.0 backbone, integrated Workspace
Best for Google Workspace users
Replit Agent
Replit
Full-stack app builder, deploys instantly
Fastest path from idea to live app
Bolt.new
StackBlitz
Instant web app generation, browser-native
Frontend prototypes in seconds
Lovable
Lovable
Product-focused app builder, Supabase integration
Non-technical founders
v0
Vercel
UI component generation, React/Tailwind
Frontend engineers, design-to-code
Enterprise Agent Platforms
Tool
Stars
Architecture
Strength
Claude Code
—
1M context, MCP, hooks, skills, memory, worktrees
Deep codebase understanding, agentic loops
Cursor
—
IDE-native, agent mode, Composer
Best IDE integration, fastest iteration
Windsurf
—
Cascade flow, multi-file awareness
Flow-based editing, Codeium integration
Cline
28K+
VS Code extension, any model, MCP
Open source, model-agnostic, community
Aider
25K+
Terminal, git-aware, pair programming
CLI users, git-native, repo-wide edits
Continue.dev
20K+
Open source, any model, VS Code + JetBrains
Privacy-first, self-hosted model support
OpenHands
65K+
Docker-isolated, web UI, MIT license
Sandboxed execution, evaluation benchmarks
SWE-agent
15K+
Princeton, ACI interface, benchmark-driven
Research, SWE-bench evaluation
GitHub Copilot Agent
—
Tight GitHub integration, code review, PR summaries
GitHub-native workflows
Codex CLI
18K+
Sandboxed execution, terminal, multimodal
OpenAI-native, safe local execution
Amp
—
Sourcegraph-backed, repo-wide context
Large monorepo navigation
Kiro
—
Spec-driven development, AWS integration
AWS-native teams
SWE-bench Leaderboard (March 2026)
Agent
SWE-bench Verified
SWE-bench Pro
OpenHands + Claude Sonnet
80.0%
—
SWE-agent + GPT-4o
55.0%
—
Devin 2.0
—
23.7%
Amazon Q Developer
50.0%
—
MCP — Model Context Protocol
github.com/modelcontextprotocol | Anthropic | 10,000+ servers | 97M SDK downloads
Universal open standard for connecting AI agents to tools, data, and APIs
Transport: stdio (local), HTTP+SSE (remote), Streamable HTTP (2025 spec)
Server types: Tools, Resources, Prompts, Sampling
Major adopters: VS Code, Claude, Cursor, Windsurf, Zed, JetBrains, Cline
MCP Server Hub — discover and share servers
Client → MCP Server → Tool/DB/API
(JSON-RPC 2.0)
A2A — Agent-to-Agent Protocol
github.com/google/A2A | Google | 100+ partners | Linux Foundation project
Open protocol for agent-to-agent communication, independent of internal architecture
Agent Cards: JSON discovery files (/.well-known/agent.json)
Complements MCP: A2A for agent↔agent, MCP for agent↔tool
Partners: SAP, Salesforce, ServiceNow, MongoDB, Atlassian, Box
Pattern
Token Cost
Latency
Reliability
Best For
Orchestrator-Worker
Medium
Medium
High (90.2% improvement)
Production default, parallelizable tasks
Pipeline
Low
Low
Very High
Sequential, deterministic workflows
Debate / Critique
High
High
Very High
High-stakes decisions, accuracy-critical
Swarm / Handoffs
Low–Med
Low
Medium
Customer service, routing, triage
Mixture of Agents
High
Medium
Highest
Consensus, adversarial robustness
Hierarchical
Medium
Medium
High
Complex nested tasks, management layers
Reflection
Medium
Medium
High
Code review, self-improvement loops
Orchestrator-Worker — One planning agent decomposes tasks, multiple specialist agents execute in parallel. Delivers 90.2% task completion improvement over single-agent baseline. Production default for 2026.
Debate / Critique — Two agents propose + critique solutions. Best accuracy for reasoning-heavy tasks (math, logic, strategy). 30–50% higher accuracy, 2–3x token cost.
Swarm / Handoffs — Agents hand off context to specialists based on conversation state. OpenAI's Swarm evolved into the Agents SDK handoff primitive.
Tool
Stars
Architecture
Best For
Mem0
28K+
Managed, vector + graph, cross-session
Production apps needing managed memory
Letta / MemGPT
14K+
Stateful agents, editable memory blocks, self-editing
Long-running agents, persistent personas
Zep
7K+
Temporal knowledge graph, entity tracking
CRM-like memory, relationship tracking
Cognee
5K+
Knowledge graph + reasoning, GraphRAG
Complex relational knowledge, research
Working memory → Context window (in-prompt)
Episodic memory → Session history (databases)
Semantic memory → Vector embeddings (knowledge)
Procedural memory → System prompts, skills, rules
Tool
Stars
Key Feature
Best For
Langfuse
10K+
Open source, prompt versioning, self-hostable
Privacy-first, LangChain/CrewAI teams
LangSmith
—
Zero-overhead tracing, LangChain ecosystem
LangChain/LangGraph production
AgentOps
3K+
Session replay, 400+ framework integrations
Framework-agnostic, debugging agents
Braintrust
—
80x faster evals, dataset management
Evaluation-driven development
Helicone
4K+
Gateway-based, no SDK changes needed
Drop-in observability, any provider
Arize Phoenix
6K+
Open source, LLM evals, embeddings
ML teams, explainability
Traces : full agent reasoning chain, tool calls, sub-agent spawns
Spans : latency per step, token cost breakdown
Evals : task success rate, hallucination rate, tool accuracy
Replays : reproduce exact session state for debugging
8. Agent Security & Guardrails
Tool
Stars
Approach
Best For
Guardrails AI
5K+
Validators on input/output, retry logic
Structured output validation
NeMo Guardrails
4K+
Conversation rails, topic control
NVIDIA stack, conversation safety
LLM Guard
3K+
Prompt sanitization, PII detection
Enterprise compliance
Rebuff
2K+
Prompt injection detection, heuristic + ML
Injection-aware deployments
Key Threat Vectors (2026)
Threat
Description
Mitigation
Prompt Injection
Malicious content in tool outputs hijacks agent
Input sanitization, sandboxed execution
Tool Poisoning
Compromised MCP servers return malicious instructions
Server allowlists, output validation
Data Exfiltration
Agent leaks sensitive context to external tools
Output filtering, egress controls
Agent Impersonation
Rogue agent spoofs trusted agent identity
A2A auth, signed Agent Cards
Runaway Loops
Agent gets stuck in infinite tool-call loops
Max steps, circuit breakers, timeouts
Sandbox tool execution (Docker, E2B, Daytona)
Validate all tool inputs and outputs
Implement max-steps and cost limits
Use least-privilege API scopes
Log all tool calls for audit trails
Human-in-the-loop for irreversible actions
Platform
Type
Key Feature
Best For
Retell AI
Managed
500ms latency, interruption handling, CRM integrations
Sales, support call centers
Vapi
Managed
600ms latency, 100+ providers, phone numbers
Developers building voice products
Bland AI
Managed
Enterprise-grade, call routing, post-call analysis
High-volume outbound calling
LiveKit Agents
Open Source
Real-time audio/video, STT+LLM+TTS pipeline
Self-hosted, custom voice agents
OpenAI Realtime API
API
Native voice, vision, interruption detection
GPT-4o voice integration
ElevenLabs Conversational
Managed
Ultra-realistic voices, 32 languages
High-fidelity voice quality
Platform
Key Feature
Best For
Modal
GPU serverless, sub-second cold starts, cron
Python-native, batch agent jobs
Replicate
Model hosting, prediction API, fine-tuning
ML model deployment, any framework
Together AI
Fast inference, 100+ models, fine-tuning
Open model inference at scale
Fireworks AI
Fastest open model inference, FireFunction
Low-latency tool-calling agents
E2B
Sandboxed code execution, 150ms boot
Safe code execution inside agents
Daytona
Elastic AI code infra, secure workspaces
Coding agent sandboxes
Platform
Key Feature
Best For
BentoML
Model serving, batching, async
ML engineers, custom inference
Ollama
100+ models, simple CLI, GPU support
Local development, privacy-first
LocalAI
OpenAI-compatible API, any model
Drop-in local OpenAI replacement
vLLM
High-throughput serving, PagedAttention
Production local inference at scale
Tool
Stars
Key Feature
Best For
LlamaIndex
37K+
Advanced RAG, structured queries, agents
Complex document understanding
Haystack
18K+
Production NLP pipelines, modular
Enterprise search, QA systems
DSPy
21K+
Programmatic LM optimization, signatures
Prompt optimization, research
Database
Stars
Key Feature
Best For
Qdrant
22K+
Rust, payload filtering, sparse+dense
Production, fast filtered search
Chroma
17K+
Embedded, developer-friendly
Prototyping, local development
Weaviate
12K+
GraphQL, multi-tenancy, hybrid search
Enterprise, multi-tenant SaaS
Milvus
32K+
Cloud-native, billion-scale
High-scale production
pgvector
14K+
PostgreSQL extension
PostgreSQL shops, no new infra
Pinecone
—
Managed, serverless, real-time upserts
Fully managed, no ops burden
12. Key Stats (March 2026)
Metric
Value
Source
Enterprises with agents in production
57%
McKinsey State of AI 2026
Fortune 500 deploying multi-agent workflows
73%
Gartner Q1 2026
SWE-bench Verified SOTA
80% (OpenHands + Claude)
SWE-bench.org
SWE-bench Pro SOTA
23.7% (Devin 2.0)
Cognition
MCP servers live
10,000+
Anthropic
MCP SDK downloads
97M
Anthropic
A2A protocol partners
100+
Google
Projected agent market by 2028
$450B
Grand View Research
Voice agent market CAGR
34%
MarketsandMarkets
Average agent task completion improvement (multi-agent vs single)
90.2%
Stanford HAI 2026
Model Benchmarks (March 2026)
Model
MMLU
HumanEval
MATH
Context
Claude Sonnet 4
90.2%
92.4%
86.1%
1M
GPT-4o (March 2026)
89.7%
90.1%
83.4%
128K
Gemini 2.0 Ultra
91.0%
88.3%
88.7%
1M
DeepSeek R2
88.9%
91.2%
90.1%
128K
Llama 4 Maverick
85.3%
84.1%
79.8%
1M
13. Related Awesome Lists
Community
PRs welcome. Please follow these guidelines:
Add tools with GitHub stars, key feature, and best-for use case
Keep descriptions to one line — link to official docs for details
Group by the most relevant section
Update star counts when significantly outdated (>25% change)
See CONTRIBUTING.md for full guidelines.
MIT License — see LICENSE for details.
Last updated: March 2026 | Report an issue | Request addition