Skip to content

Supersynergy/awesome-ai-agents-2025

Repository files navigation

Awesome AI Agents 2026

Awesome License: MIT PRs Welcome Updated

Complete directory of AI agents, frameworks, platforms, and tools — March 2026 Edition. Covers 100+ tools across coding agents, multi-agent patterns, MCP/A2A protocols, memory, observability, and security.

57% of enterprises have AI agents in production. 73% of Fortune 500 are deploying multi-agent workflows. This list cuts through the noise.


Table of Contents

  1. Agent Frameworks (Production-Ready)
  2. Agent Platforms & Products
  3. Coding Agents
  4. Protocols & Standards
  5. Multi-Agent Patterns
  6. Agent Memory
  7. Agent Observability
  8. Agent Security & Guardrails
  9. Voice Agents
  10. Agent Deployment
  11. RAG & Knowledge
  12. Key Stats (March 2026)
  13. Related Awesome Lists

1. Agent Frameworks (Production-Ready)

Framework Stars Key Feature Best For
LangGraph 1.0.8 19K+ Durable execution, DAG-based state machines Complex stateful workflows, production pipelines
CrewAI 2.x 38K+ Role-based multi-agent crews, task delegation Team simulations, parallel agent collaboration
OpenAI Agents SDK 12K+ Routines + handoffs (replaces Swarm) OpenAI-native agent workflows, voice integration
Claude Agent SDK MCP-native, hooks, memory, skills, worktrees Anthropic-native, Claude Code extensions
Google ADK 8K+ A2A protocol native, Vertex AI integration Google Cloud agents, A2A interoperability
Microsoft AutoGen 0.4+ 50K+ Multi-agent conversation, actor model Research, enterprise multi-agent dialogue
Pydantic AI 11K+ Type-safe agents, validated structured outputs Production APIs, strict output contracts
SmolAgents 18K+ Minimalist CodeAgent, ~1K lines core Learning, lightweight deployments, HuggingFace models
AG2 7K+ Community AutoGen fork, faster releases AutoGen users wanting community-driven roadmap
LangChain 116K+ Massive ecosystem, 1000+ integrations Prototyping, integration-heavy workflows
Agno 35K+ Multimodal agents, runtime + control plane Multimodal tasks, agent observability
Composio 27K+ 100+ tool integrations, MCP support Tool-heavy agents, SaaS automation

Local / Open-Source Runtimes

Framework Stars Key Feature Best For
Open Interpreter 56K+ Natural language → code execution Local automation, full OS control
Dify 60K+ Visual workflow builder, self-hosted Teams wanting GUI agent builder
Flowise 32K+ Drag-and-drop node editor Low-code agent building
Goose 10K+ Block's extensible agent, MCP-native Developer automation on local machines

2. Agent Platforms & Products

Autonomous Agents (Cloud)

Platform Backing Key Feature Notable
Devin 2.0 Cognition Autonomous SWE, parallel instances, 30-day project memory Solves 13.86% SWE-bench (Pro) autonomously
Manus AI Meta ($2B acq.) Iterate-loop multi-agent, 100+ tool integrations Most viral agent demo of early 2026
OpenAI Deep Research OpenAI 5–30 min autonomous research sessions, citation-backed GPT-4o + o3 backbone, best research agent
Google Deep Research Google Gemini 2.0 backbone, integrated Workspace Best for Google Workspace users
Replit Agent Replit Full-stack app builder, deploys instantly Fastest path from idea to live app
Bolt.new StackBlitz Instant web app generation, browser-native Frontend prototypes in seconds
Lovable Lovable Product-focused app builder, Supabase integration Non-technical founders
v0 Vercel UI component generation, React/Tailwind Frontend engineers, design-to-code

Enterprise Agent Platforms

Platform Key Feature Best For
Microsoft Copilot Studio Teams + M365 integration, AutoGen backend Enterprise Microsoft shops
Salesforce Agentforce CRM-native, Einstein AI Sales and service automation
ServiceNow AI Agents ITSM-native, workflow automation IT operations
Workday AI Agents HR + Finance workflows Enterprise HR/Finance teams

3. Coding Agents

Tool Stars Architecture Strength
Claude Code 1M context, MCP, hooks, skills, memory, worktrees Deep codebase understanding, agentic loops
Cursor IDE-native, agent mode, Composer Best IDE integration, fastest iteration
Windsurf Cascade flow, multi-file awareness Flow-based editing, Codeium integration
Cline 28K+ VS Code extension, any model, MCP Open source, model-agnostic, community
Aider 25K+ Terminal, git-aware, pair programming CLI users, git-native, repo-wide edits
Continue.dev 20K+ Open source, any model, VS Code + JetBrains Privacy-first, self-hosted model support
OpenHands 65K+ Docker-isolated, web UI, MIT license Sandboxed execution, evaluation benchmarks
SWE-agent 15K+ Princeton, ACI interface, benchmark-driven Research, SWE-bench evaluation
GitHub Copilot Agent Tight GitHub integration, code review, PR summaries GitHub-native workflows
Codex CLI 18K+ Sandboxed execution, terminal, multimodal OpenAI-native, safe local execution
Amp Sourcegraph-backed, repo-wide context Large monorepo navigation
Kiro Spec-driven development, AWS integration AWS-native teams

SWE-bench Leaderboard (March 2026)

Agent SWE-bench Verified SWE-bench Pro
OpenHands + Claude Sonnet 80.0%
SWE-agent + GPT-4o 55.0%
Devin 2.0 23.7%
Amazon Q Developer 50.0%

4. Protocols & Standards

MCP — Model Context Protocol

github.com/modelcontextprotocol | Anthropic | 10,000+ servers | 97M SDK downloads

  • Universal open standard for connecting AI agents to tools, data, and APIs
  • Transport: stdio (local), HTTP+SSE (remote), Streamable HTTP (2025 spec)
  • Server types: Tools, Resources, Prompts, Sampling
  • Major adopters: VS Code, Claude, Cursor, Windsurf, Zed, JetBrains, Cline
  • MCP Server Hub — discover and share servers
Client → MCP Server → Tool/DB/API
         (JSON-RPC 2.0)

A2A — Agent-to-Agent Protocol

github.com/google/A2A | Google | 100+ partners | Linux Foundation project

  • Open protocol for agent-to-agent communication, independent of internal architecture
  • Agent Cards: JSON discovery files (/.well-known/agent.json)
  • Complements MCP: A2A for agent↔agent, MCP for agent↔tool
  • Partners: SAP, Salesforce, ServiceNow, MongoDB, Atlassian, Box

OpenAI Standards

Standard Purpose
Function Calling Tool use, structured outputs
Realtime API Voice + vision agents
Assistants API Thread-persistent agents

5. Multi-Agent Patterns

Pattern Token Cost Latency Reliability Best For
Orchestrator-Worker Medium Medium High (90.2% improvement) Production default, parallelizable tasks
Pipeline Low Low Very High Sequential, deterministic workflows
Debate / Critique High High Very High High-stakes decisions, accuracy-critical
Swarm / Handoffs Low–Med Low Medium Customer service, routing, triage
Mixture of Agents High Medium Highest Consensus, adversarial robustness
Hierarchical Medium Medium High Complex nested tasks, management layers
Reflection Medium Medium High Code review, self-improvement loops

Pattern Details

Orchestrator-Worker — One planning agent decomposes tasks, multiple specialist agents execute in parallel. Delivers 90.2% task completion improvement over single-agent baseline. Production default for 2026.

Debate / Critique — Two agents propose + critique solutions. Best accuracy for reasoning-heavy tasks (math, logic, strategy). 30–50% higher accuracy, 2–3x token cost.

Swarm / Handoffs — Agents hand off context to specialists based on conversation state. OpenAI's Swarm evolved into the Agents SDK handoff primitive.


6. Agent Memory

Tool Stars Architecture Best For
Mem0 28K+ Managed, vector + graph, cross-session Production apps needing managed memory
Letta / MemGPT 14K+ Stateful agents, editable memory blocks, self-editing Long-running agents, persistent personas
Zep 7K+ Temporal knowledge graph, entity tracking CRM-like memory, relationship tracking
Cognee 5K+ Knowledge graph + reasoning, GraphRAG Complex relational knowledge, research

Memory Taxonomy

Working memory    → Context window (in-prompt)
Episodic memory   → Session history (databases)
Semantic memory   → Vector embeddings (knowledge)
Procedural memory → System prompts, skills, rules

7. Agent Observability

Tool Stars Key Feature Best For
Langfuse 10K+ Open source, prompt versioning, self-hostable Privacy-first, LangChain/CrewAI teams
LangSmith Zero-overhead tracing, LangChain ecosystem LangChain/LangGraph production
AgentOps 3K+ Session replay, 400+ framework integrations Framework-agnostic, debugging agents
Braintrust 80x faster evals, dataset management Evaluation-driven development
Helicone 4K+ Gateway-based, no SDK changes needed Drop-in observability, any provider
Arize Phoenix 6K+ Open source, LLM evals, embeddings ML teams, explainability

What to Observe

  • Traces: full agent reasoning chain, tool calls, sub-agent spawns
  • Spans: latency per step, token cost breakdown
  • Evals: task success rate, hallucination rate, tool accuracy
  • Replays: reproduce exact session state for debugging

8. Agent Security & Guardrails

Guardrail Frameworks

Tool Stars Approach Best For
Guardrails AI 5K+ Validators on input/output, retry logic Structured output validation
NeMo Guardrails 4K+ Conversation rails, topic control NVIDIA stack, conversation safety
LLM Guard 3K+ Prompt sanitization, PII detection Enterprise compliance
Rebuff 2K+ Prompt injection detection, heuristic + ML Injection-aware deployments

Key Threat Vectors (2026)

Threat Description Mitigation
Prompt Injection Malicious content in tool outputs hijacks agent Input sanitization, sandboxed execution
Tool Poisoning Compromised MCP servers return malicious instructions Server allowlists, output validation
Data Exfiltration Agent leaks sensitive context to external tools Output filtering, egress controls
Agent Impersonation Rogue agent spoofs trusted agent identity A2A auth, signed Agent Cards
Runaway Loops Agent gets stuck in infinite tool-call loops Max steps, circuit breakers, timeouts

Secure Agent Checklist

  • Sandbox tool execution (Docker, E2B, Daytona)
  • Validate all tool inputs and outputs
  • Implement max-steps and cost limits
  • Use least-privilege API scopes
  • Log all tool calls for audit trails
  • Human-in-the-loop for irreversible actions

9. Voice Agents

Platform Type Key Feature Best For
Retell AI Managed 500ms latency, interruption handling, CRM integrations Sales, support call centers
Vapi Managed 600ms latency, 100+ providers, phone numbers Developers building voice products
Bland AI Managed Enterprise-grade, call routing, post-call analysis High-volume outbound calling
LiveKit Agents Open Source Real-time audio/video, STT+LLM+TTS pipeline Self-hosted, custom voice agents
OpenAI Realtime API API Native voice, vision, interruption detection GPT-4o voice integration
ElevenLabs Conversational Managed Ultra-realistic voices, 32 languages High-fidelity voice quality

10. Agent Deployment

Serverless / Cloud

Platform Key Feature Best For
Modal GPU serverless, sub-second cold starts, cron Python-native, batch agent jobs
Replicate Model hosting, prediction API, fine-tuning ML model deployment, any framework
Together AI Fast inference, 100+ models, fine-tuning Open model inference at scale
Fireworks AI Fastest open model inference, FireFunction Low-latency tool-calling agents
E2B Sandboxed code execution, 150ms boot Safe code execution inside agents
Daytona Elastic AI code infra, secure workspaces Coding agent sandboxes

Self-Hosted

Platform Key Feature Best For
BentoML Model serving, batching, async ML engineers, custom inference
Ollama 100+ models, simple CLI, GPU support Local development, privacy-first
LocalAI OpenAI-compatible API, any model Drop-in local OpenAI replacement
vLLM High-throughput serving, PagedAttention Production local inference at scale

11. RAG & Knowledge

Orchestration Frameworks

Tool Stars Key Feature Best For
LlamaIndex 37K+ Advanced RAG, structured queries, agents Complex document understanding
Haystack 18K+ Production NLP pipelines, modular Enterprise search, QA systems
DSPy 21K+ Programmatic LM optimization, signatures Prompt optimization, research

Vector Databases

Database Stars Key Feature Best For
Qdrant 22K+ Rust, payload filtering, sparse+dense Production, fast filtered search
Chroma 17K+ Embedded, developer-friendly Prototyping, local development
Weaviate 12K+ GraphQL, multi-tenancy, hybrid search Enterprise, multi-tenant SaaS
Milvus 32K+ Cloud-native, billion-scale High-scale production
pgvector 14K+ PostgreSQL extension PostgreSQL shops, no new infra
Pinecone Managed, serverless, real-time upserts Fully managed, no ops burden

12. Key Stats (March 2026)

Metric Value Source
Enterprises with agents in production 57% McKinsey State of AI 2026
Fortune 500 deploying multi-agent workflows 73% Gartner Q1 2026
SWE-bench Verified SOTA 80% (OpenHands + Claude) SWE-bench.org
SWE-bench Pro SOTA 23.7% (Devin 2.0) Cognition
MCP servers live 10,000+ Anthropic
MCP SDK downloads 97M Anthropic
A2A protocol partners 100+ Google
Projected agent market by 2028 $450B Grand View Research
Voice agent market CAGR 34% MarketsandMarkets
Average agent task completion improvement (multi-agent vs single) 90.2% Stanford HAI 2026

Model Benchmarks (March 2026)

Model MMLU HumanEval MATH Context
Claude Sonnet 4 90.2% 92.4% 86.1% 1M
GPT-4o (March 2026) 89.7% 90.1% 83.4% 128K
Gemini 2.0 Ultra 91.0% 88.3% 88.7% 1M
DeepSeek R2 88.9% 91.2% 90.1% 128K
Llama 4 Maverick 85.3% 84.1% 79.8% 1M

13. Related Awesome Lists

List Stars Focus
awesome-ai-agents 24K+ Broad AI agents ecosystem
awesome-mcp-servers 18K+ MCP server directory
500-AI-Agents-Projects 16K+ Real-world agent projects
awesome-llm-agents 8K+ LLM-powered agents research
awesome-langchain 7K+ LangChain ecosystem
awesome-local-llm 5K+ Local model running
awesome-openai-agents 12K+ OpenAI Agents SDK examples
awesome-agentic-coding 3K+ Coding agent tools

Community

Resource Members / Activity
r/LocalLLaMA 700K+ members
r/AIAgents 150K+ members
LangChain Discord 80K+ members
Hugging Face Discord 100K+ members
AI Engineer World's Fair Annual conference

Contributing

PRs welcome. Please follow these guidelines:

  • Add tools with GitHub stars, key feature, and best-for use case
  • Keep descriptions to one line — link to official docs for details
  • Group by the most relevant section
  • Update star counts when significantly outdated (>25% change)

See CONTRIBUTING.md for full guidelines.


License

MIT License — see LICENSE for details.


Last updated: March 2026 | Report an issue | Request addition

About

Complete directory of AI agents, frameworks, platforms, and tools — March 2026 Edition. Covers 100+ tools across coding agents, multi-agent patterns, MCP/A2A protocols, memory, observability, security.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors