The 2025 AI Agent Stack: A Field Guide for Builders
August 17, 2025
ChatGPT helped generate this list of AI Agent technologies. There are variety of tools for the same job currently. It will be interesting to see which ones stand the test of time, especially since there are already 'layers' of technology emergin for building AI Agents.
Agent Frameworks & SDKs (code-first building blocks)
- OpenAI Agents SDK (Python & TypeScript) — Minimal, model-native primitives for single or multi-agent flows, tools, state, and patterns. Ships with examples and MCP support. ([OpenAI GitHub][1], [GitHub][2], [OpenAI][3])
- LangGraph (by LangChain) — Graph/state-machine orchestration for long-running, stateful agents; great for controllable loops, recovery, and human-in-the-loop. LangGraph Studio adds a visual IDE. ([LangChain AI][4], [LangChain Docs][5])
- LlamaIndex Workflows & Agents — Event-driven “Workflows” and multi-agent patterns that span RAG, extraction, and tools. ([LlamaIndex][6], [LlamaIndex][7])
- AG2 (formerly AutoGen) — Multi-agent conversations and coordination (open governance). FastAgency helps productionize AG2 projects. ([GitHub][8], [Microsoft GitHub][9])
- CrewAI — “Crews” of specialist agents with roles, tools, and handoffs; solid docs and real-world patterns. ([CrewAI][10])
- smolagents (Hugging Face) — Lightweight, code-first agents that “think in code”; minimal abstractions, easy to read and hack. ([Hugging Face][11], [GitHub][12])
- Semantic Kernel (Microsoft) — Model-agnostic SDK with an Agent Framework and patterns across .NET, Python, and Java. ([Microsoft Learn][13])
- Haystack Agents (deepset) — Tool-using agents integrated into Haystack’s RAG/search pipelines with provider-agnostic chat models. ([Haystack Documentation][14])
- PydanticAI — Type-safe agents with Pythonic ergonomics; first-class multi-agent handoff patterns. ([Pydantic AI][15])
- Vercel AI SDK (Agents) — TS-first SDK for building agents and UIs in Next.js/Node, with guides for Gemini/OpenAI and deployment on Vercel. ([Vercel][16], [Google AI for Developers][17])
- Agno (previously Phidata) — Full-stack framework for multi-agent systems with memory/knowledge/tools; nice built-in UI. ([GitHub][18], [docs.phidata.com][19])
- Langroid — Research-friendly multi-agent Python framework (CMU/UW lineage) with strong RAG patterns. ([GitHub][20], [langroid.github.io][21])
Cloud Agent Services (managed orchestration)
- Google Agent Development Kit (ADK) — Modular, model-agnostic kit for building/deploying agents; integrates with Vertex AI Agent Builder & Agent Space. ([Google GitHub][22], [Google Cloud][23])
- Vertex AI Agent Builder (Google Cloud) — Suite for building, evaluating, and deploying multi-agent experiences (Agent Garden, templates, codelabs). ([Google Cloud][24], [Google Codelabs][25])
- Agents for Amazon Bedrock — Configure autonomous agents with knowledge bases, API action groups, and multi-agent collaboration. ([AWS Documentation][26], [Amazon Web Services, Inc.][27])
- Azure AI Agent Service (Azure AI Foundry) — Managed service to host/orchestrate agents with governance, tracing, and REST SDKs. ([Microsoft Learn][28])
- Cohere Tool-Use & Agents — Chat endpoint with function calling, multi-step planning, strict tool schemas, and enterprise deploys. ([Cohere Documentation][29])
Protocols & Connectors (standardizing tool access)
- Model Context Protocol (MCP) — Open protocol (Anthropic) that standardizes connecting LLMs to tools/data (think “USB-C for AI”). ([Anthropic][30], [Model Context Protocol][31])
- MCP Toolbox for Databases (Google) — Open-source MCP server exposing safe, typed DB tools (Cloud SQL, Spanner, BigQuery, Postgres, etc.). Great with ADK. ([GitHub][32], [Google Cloud][33], [Google APIs][34])
Execution & Browser Control (give agents hands)
- E2B — Secure cloud sandboxes so agents can run code (JS/Python SDKs), ideal for “code-interpreter” and tool-heavy tasks. ([E2B][35])
- browser-use — Playwright-powered browser agent & optional MCP server for real web automation. ([GitHub][36])
- Composio — Unified connectors/skills layer (250+ apps) with new Python/TS SDKs; plugs into popular agent frameworks. ([docs.composio.dev][37], [Composio][38])
- Zapier Natural Language Actions (AI Actions) — Natural-language API to 6000+ SaaS apps—handy for agents that must “do things” in your SaaS stack. ([Zapier Docs][39])
Realtime & Voice Agents (instant, multimodal UX)
- OpenAI Realtime API — Low-latency, speech-to-speech, multimodal interactions over WebRTC or WebSockets. ([OpenAI Platform][40])
- LiveKit Agents — Open-source framework + cloud for voice/video agents; mix-and-match STT/LLM/TTS providers; Python/Node SDKs. ([LiveKit Docs][41], [GitHub][42])
- Vapi — Developer platform for phone-call voice agents with pragmatic docs and provider integrations. ([Vapi][43])
- Retell AI — Production phone agents with prompt flows, custom voices, and analytics. ([Retell AI][44])
Guardrails & Structured Outputs (control and correctness)
- NVIDIA NeMo Guardrails — Programmable “rails” via Colang to steer topics/flows and intercept risky content. ([NVIDIA Docs][45])
- Guardrails AI — Input/Output “validators” hub and server to detect/mitigate risk categories in your agent pipelines. ([guardrails][46])
- OpenAI Structured Outputs — Enforce JSON Schema-validated responses for reliable tool calls and multi-step workflows. ([OpenAI Platform][47])
- Outlines (structured generation) — Constrain token generation to JSON/regex/grammar for hard guarantees; works with multiple backends. ([Dottxt AI][48], [LangChain][49])
Observability, Evals & Tracing (ship with confidence)
- LangSmith — Unified tracing/evals/monitoring for agent apps (works with or without LangChain). ([LangSmith][50], [LangChain][51])
- Langfuse — Open-source LLM engineering platform (traces, evals, prompts, metrics); self-host or cloud. ([Langfuse][52])
- Arize Phoenix — Open-source observability and evaluation for LLM/agent systems; strong experiment workflows. ([Arize AI][53])
- AgentOps — Agent-focused observability (session replays, metrics) with quick integrations for frameworks and even ADK. ([AgentOps][54], [GitHub][55], [Google GitHub][56])
Specialist Stacks (purpose-built agents)
- OpenHands (developer agents) — Open platform for software-engineering agents that read/write code, run commands, browse, and work against benchmarks. ([GitHub][57], [All Hands Docs][58])
How to choose (fast heuristics)
- You want tight control (“leashed” step in a larger workflow): Start with OpenAI Agents SDK, PydanticAI, or LlamaIndex Workflows. Add Structured Outputs and NeMo Guardrails for predictability. ([OpenAI GitHub][1], [Pydantic AI][15], [LlamaIndex][6], [OpenAI Platform][47], [NVIDIA Docs][45])
- You need robust state machines & supervision: LangGraph (optionally with LangGraph Studio) excels at explicit state, retries, and human checkpoints. ([LangChain AI][4], [LangChain Docs][5])
- Enterprise managed service: Pick your cloud: Vertex AI Agent Builder (+ ADK), Azure AI Agent Service, or Agents for Bedrock depending on stack/regulatory needs. ([Google Cloud][24], [Google GitHub][22], [Microsoft Learn][28], [AWS Documentation][26])
- Your agent must actually do stuff in SaaS/tools: Add Composio or Zapier NLA as a universal skill layer. ([docs.composio.dev][37], [Zapier Docs][39])
- It must browse or run code safely: Combine browser-use (web actions) with E2B (secure sandboxes). ([GitHub][36], [E2B][35])
- You need voice now: LiveKit Agents or Vapi for end-to-end realtime voice; OpenAI Realtime for the model side. ([LiveKit Docs][41], [Vapi][43], [OpenAI Platform][40])
- Production discipline: Instrument early with LangSmith/Langfuse/Phoenix and consider AgentOps for agent-specific replays. ([LangSmith][50], [Langfuse][52], [Arize AI][53], [AgentOps][54])
References
[1]: "Agents - OpenAI Agents SDK"[2]: "openai/openai-agents-python: A lightweight, powerful ..."
[3]: "New tools for building agents"
[4]: "LangGraph - GitHub Pages"
[5]: "Overview - Docs by LangChain"
[6]: "Workflows"
[7]: "Introducing Agentic Document Workflows"
[8]: "ag2ai"
[9]: "AutoGen 0.2 - Microsoft Open Source"
[10]: "Introduction - CrewAI"
[11]: "smolagents"
[12]: "smolagents: a barebones library for agents that think in code."
[13]: "Semantic Kernel Agent Framework"
[14]: "Agents - Haystack Documentation"
[15]: "Agents"
[16]: "AI SDK"
[17]: "Market Research Agent with Gemini and the AI SDK by Vercel"
[18]: "agno-agi/agno: Open-source framework for building multi ..."
[19]: "Phidata - Agno"
[20]: "langroid/langroid: Harness LLMs with Multi-Agent ..."
[21]: "langroid - GitHub Pages"
[22]: "Agent Development Kit - Google"
[23]: "Develop an Agent Development Kit agent | Generative AI ..."
[24]: "Vertex AI Agent Builder overview"
[25]: "Building AI Agents with Vertex AI Agent Builder"
[26]: "Automate tasks in your application using AI agents"
[27]: "Amazon Bedrock Agents"
[28]: "Azure AI Foundry documentation"
[29]: "Basic usage of tool use (function calling)"
[30]: "Model Context Protocol (MCP)"
[31]: "Model Context Protocol: Introduction"
[32]: "MCP Toolbox for Databases is an open source ..."
[33]: "New MCP integrations to Google Cloud Databases"
[34]: "Introduction | MCP Toolbox for Databases"
[35]: "E2B - Code Interpreting for AI apps"
[36]: "browser-use/browser-use: 🌐 Make websites accessible for ..."
[37]: "Composio Docs: Welcome to Composio"
[38]: "Our new SDK is in beta"
[39]: "AI Actions"
[40]: "Realtime API Beta"
[41]: "LiveKit Agents"
[42]: "livekit/agents: A powerful framework for building realtime ..."
[43]: "Vapi: Introduction"
[44]: "Retell AI: Introduction"
[45]: "NVIDIA NeMo Guardrails"
[46]: "Introduction | Your Enterprise AI needs Guardrails"
[47]: "Structured model outputs - OpenAI API"
[48]: "Outlines"
[49]: "Outlines"
[50]: "Get started with LangSmith | 🦜️🛠️ LangSmith"
[51]: "LangSmith"
[52]: "Langfuse Documentation"
[53]: "Arize Phoenix"
[54]: "AgentOps AI agents"
[55]: "AgentOps-AI/agentops: Python SDK for AI agent monitoring ..."
[56]: "AgentOps - Agent Development Kit - Google"
[57]: "OpenHands: Code Less, Make More"
[58]: "Introduction - All Hands Docs"