AGENTS & ORCHESTRATION
AI Agents Explained
What they actually are, beyond the hype
In 30 Seconds
“AI agent” has become one of the most used and least understood terms in technology. People nod along in meetings, unsure what it actually means. Vendors use it to describe everything from simple chatbots to autonomous systems.
The core idea is simple: an AI system that can take actions, not just generate text. But the details matter enormously. Different communities use the same words to mean different things, and the architecture choices have real consequences.
This guide cuts through the confusion. Clear definitions. Key distinctions. Practical implications. No assumed knowledge.
What Is an AI Agent?
An AI system that can take actions, not just generate text.
A regular LLM (like ChatGPT or Claude in a browser) generates text responses. You ask a question, it answers. That's valuable, but it's fundamentally passive.
An AI agent does more. It can:
Use Tools
Search the web, execute code, query databases, call APIs, interact with software. The agent doesn't just suggest actions — it performs them.
Plan and Reason
Break down complex tasks into steps. Decide what to do next based on results. Handle multi-step work without human guidance at every stage.
Maintain State
Remember what's happened so far. Track progress toward goals. Build on previous steps rather than starting fresh each time.
Work Toward Goals
Not just responding to prompts, but pursuing objectives. “Book me a flight” triggers a sequence of searches, comparisons, and booking actions.
The key insight: Most “AI agents” in production today are actually agentic systems — LLMs with tool access and orchestration logic, operating with human oversight. Fully autonomous agents are still rare.
Agent vs Agentic: The Critical Distinction
This distinction has become crucial in enterprise adoption
AI Agent
- •Single-task or specific function
- •Rule-following with limited adaptation
- •Executes assigned steps
- •Example: Note summariser, booking assistant
Agentic AI
- •Multi-step, complex goals
- •Reasoning, planning, self-adjustment
- •Orchestrates workflows, coordinates agents
- •Example: Research system, code assistant
All agentic systems qualify as agents, but not vice versa. The term “agentic” emphasises agency — the capacity to plan, adapt, and coordinate.
Workflows vs Agents: Who Controls the Flow?
Anthropic makes a useful architectural distinction that clarifies most confusion:
Workflow
LLMs and tools orchestrated through predefined code paths. The sequence is determined by the developer. Predictable and consistent.
Example: Generate text → validate → translate → publish. Each step is fixed.
Agent
LLMs dynamically direct their own processes and tool usage. The model decides what to do next. Flexible but less predictable.
Example: “Research this topic” — the agent decides what to search, when to stop, what to synthesise.
The practical choice: Most organisations in 2025-26 are building workflows (predefined orchestration) rather than fully autonomous agents. This gives AI capability with human oversight and predictable behaviour.
The Skills Confusion
The word “skills” means completely different things in different contexts. This causes significant confusion when people use the same term for opposite architectures.
Anthropic Agent Skills (Official)
Filesystem-based resources that provide domain expertise. SKILL.md files with instructions, scripts, and reference materials. Claude auto-discovers and loads them when relevant.
Key characteristic: Deterministic structure, triggered loading. Published as an open standard in December 2025.
Vercel “Skills” (On-Demand Retrieval)
A retrieval system where the AI decides which documentation to fetch based on the task. Similar to RAG — the model guesses what knowledge might help and retrieves it.
Performance finding: 53% accuracy — no better than having no documentation. The retrieval decision introduced a failure point.
General Usage
Loosely refers to any capability: “This agent has coding skills,” “We need research skills.” Marketing language for features.
Key characteristic: Non-technical, describes outcomes not architecture.
The takeaway: When someone says “skills,” ask what they mean. Anthropic skills and Vercel skills are almost opposite architectures.
The Hierarchy: Agents, Knowledge, and Tools
How these concepts relate to each other
AGENTS
Systems that act — combine knowledge + tools + reasoning
KNOWLEDGE
Guides decisions
Skills
Procedural “how to”
Digital Experts
Expert judgment
TOOLS
Execute actions — search, code execution, API calls
Agents
Make decisions about what to do and when. Maintain state. Work toward goals.
Skills
Procedural knowledge that guides how to approach tasks. “When requesting supplier emissions data, use this template.”
Digital Experts
Captured expert judgment — what a specific expert would think. “What would a biodiversity specialist flag in this land use proposal?”
Tools
Deterministic capabilities. Same input, same output. Bash, file operations, web search, MCP servers.
Key distinction: Skills tell you how to do things. Digital Experts tell you what expert Y would recommend. They're complementary.
What Is MCP?
Model Context Protocol — the “USB-C for AI”
MCP is an open standard that enables AI agents to connect to external tools and data sources. Think of it as a universal plug: any MCP-compatible tool works with any MCP-compatible agent.
MCP Host
The AI application that uses external tools. Claude Code, VS Code extensions, custom applications.
MCP Client
Bridges communication between Host and Server. Handles the back-and-forth interactions.
MCP Server
The external service being accessed. Database, API, file system, web scraper.
Why MCP Matters
Before MCP, connecting an AI to external tools meant custom integration for each combination. Now, tool builders create one MCP server, and it works with any MCP host. Universal interoperability regardless of the underlying LLM.
Enterprise Reality Check
The hype says agents are transforming everything. The reality is more nuanced.
What's True
- • Agents can handle complex, multi-step tasks
- • They dramatically extend AI capability beyond chat
- • Early adopters are seeing real productivity gains
- • The technology has matured significantly in 2025
What's Overstated
- • Only 6% of enterprises trust agents fully
- • 40% of agent projects may be cancelled (Gartner)
- • Governance and security challenges are real
- • Most production systems need human oversight
The honest position: AI agents are genuinely powerful and getting better rapidly. But most organisations should start with agentic workflows(predefined orchestration with human oversight) before attempting fully autonomous agents.
Quick Reference: The Terminology Map
Same words, different meanings across communities
| Term | Engineering | Business/Product |
|---|---|---|
| Agent | LLM + tools + orchestration | AI feature that does things |
| Agentic | Agent capabilities + oversight | Marketing term |
| Skills | Filesystem expertise (Anthropic) or retrieval (Vercel) | Features, capabilities |
| Digital Experts | Captured expert methodology, activated as context | “AI that thinks like [expert]” |
| Tools | Function calling, MCP servers | Integrations |
| Memory | Context persistence | “The AI remembers me” |
| RAG | Retrieval-augmented generation | “AI searches documents” |
| Workflow | Predefined orchestration paths | Business process |
| Agent Teams | Multiple agent instances coordinating via shared task list | “AI working in parallel” |
When in doubt, ask for clarification. The same word often means different things to different people in the same meeting.
Questions to Ask (Yourself or Vendors)
About Architecture
- • Is this a workflow or a fully autonomous agent?
- • How does the system decide what context to use?
- • What happens when it misses relevant information?
- • How is state maintained across sessions?
About Governance
- • What's the human oversight model?
- • What guardrails are in place?
- • How do you measure success?
- • What can go wrong, and how is it handled?
Clarity Over Hype
AI agents represent a genuine capability shift — but understanding what they actually are (and aren't) matters more than adopting the latest terminology.
We help organisations cut through the confusion, understand the architecture choices, and build AI capability that actually works.
Where To Go Next
The Skills Paradigm
Deep dive into skills-based architecture: packaged expertise that compounds. How skills extend agents — and now agent teams.
Context Engineering
The discipline that powers agents. What AI knows, when it knows it, and how that knowledge is structured.
Passive Context Architecture
Why always-present context beats on-demand retrieval. The research behind 100% vs 53% accuracy.
Context Drift
The persistent challenge in AI memory. Why context degrades and what actually works for multi-session continuity.
Disclaimer: This content is for general educational and informational purposes only. The AI landscape evolves rapidly. Definitions and best practices referenced here reflect publicly available sources as of February 2026. For specific guidance, consult qualified professionals.