What is AI advisory and why do organisations need it?

AI advisory helps organisations navigate the complex AI landscape to find real value beyond the hype. Most AI projects fail not because of technology, but because of context problems, coordination gaps, and capability issues. AI advisory addresses these practical blockers to help you build AI capabilities that actually work.

What is context engineering in AI?

Context engineering is the discipline of designing what AI knows, when it knows it, and how that knowledge is structured. It's about giving AI the right information at the right time. Most AI failures are context failures - the model is capable, but it doesn't have the information it needs to help you effectively.

What's the difference between AI tools and AI fluency?

AI tools are software you can access. AI fluency is the ability to think with AI - to know when and how to use it effectively, to structure problems for AI collaboration, and to evaluate AI outputs critically. Many teams have tool access but struggle to get consistent value because they lack genuine fluency.

How do skills-based AI systems differ from traditional AI agents?

Traditional multi-agent systems create separate AI agents for different tasks. Skills-based systems give one AI agent multiple skills - packaged expertise it can invoke as needed. Skills compound: every skill you create is available to everyone, forever. This approach is simpler to maintain and scales organisational intelligence.

Which AI model should my organisation use - Claude, GPT, or Gemini?

The best model depends on your specific use case, data sensitivity requirements, and integration needs. Claude excels at nuanced reasoning and safety. GPT has the largest ecosystem. Gemini offers strong multimodal capabilities and Google integration. We help organisations evaluate options against their actual requirements rather than following hype.

What is context engineering and why does it matter for AI?

Context engineering is the discipline of designing what AI knows, when it knows it, and how that knowledge is structured. It matters because most AI failures aren't model problems - they're context problems. The model is capable, but without the right context, it can't help you effectively.

What is context rot in AI systems?

Context rot occurs when AI context becomes stale, contradictory, or cluttered over time. As conversations grow longer and information accumulates, the AI struggles to identify what's current and relevant. This leads to degraded responses and inconsistent behaviour.

What are the four context strategies for AI systems?

The four context strategies are: Write (curated context documents and system prompts), Select (retrieving relevant context through RAG), Compress (summarising and distilling information), and Isolate (using separate contexts for different tasks to prevent contamination).

What is layered context architecture?

Layered context architecture organises AI context into distinct layers: foundational context (always present, like identity and rules), session context (current task information), and dynamic context (retrieved as needed). This structure ensures the right information is available at the right time.

How do you prevent 'lost in the middle' problems with AI context?

The 'lost in the middle' problem occurs when AI pays less attention to information in the middle of long contexts. Prevention strategies include putting critical information at the start or end, using clear section markers, and keeping context focused rather than dumping everything in.

Is passive context better than on-demand retrieval?

Research from Vercel's Next.js team (January 2026) found that passive context (information always present) achieved 100% accuracy compared to 53% for on-demand retrieval. Passive context wins because there's no decision point about what to load, information is consistently available, and there are no ordering issues.

What is passive context in AI systems?

Passive context is information that's always present in the AI's context window, loaded automatically at session start rather than retrieved on-demand. Examples include system instructions, project documentation (like CLAUDE.md or AGENTS.md), and persistent memory files.

What's the difference between passive context and RAG?

Passive context is always present - no retrieval decision needed. RAG (Retrieval-Augmented Generation) dynamically fetches information based on the query. Passive context provides consistent baseline knowledge; RAG supplements with specific details when needed.

Why does passive context outperform on-demand retrieval?

Vercel's research found passive context achieved 100% accuracy vs 53% for on-demand retrieval. Passive wins because: (1) no decision point about what to load, (2) consistent availability every turn, (3) no ordering issues where critical info arrives too late.

How much passive context is too much?

Research suggests effective reasoning capacity is around 100K tokens (the 'Memento Limit'), even if context windows are larger. We recommend tiered budgets: ~300 tokens for compressed state, ~1,000 tokens for active context, with domain knowledge loaded on-demand.

Should I use passive context or RAG?

Use both strategically. Passive context for information needed every session (identity, rules, current state). RAG for large knowledge bases where only portions are relevant per query. The pattern: passive foundation + selective retrieval.

PANDIONSTUDIO

CONTEXT ENGINEERING

Passive Context Architecture

Why always-present beats on-demand

In 30 Seconds

There are two ways to give AI the information it needs: load it upfront (passive context) or fetch it when needed (on-demand retrieval). Most teams assume retrieval is smarter. The research says otherwise.

Vercel's Next.js team ran rigorous evaluations comparing these approaches. The result: passive context achieved 100% accuracy where on-demand retrieval achieved 53%.

The insight: When information is always present, there's no decision point that can fail. No retrieval logic to get wrong. No ordering issues. Just consistent availability.

The Research

Vercel AGENTS.md Evaluation (January 2026) →

Vercel's Next.js team tested how AI coding agents perform with different context configurations. They compared baseline performance against various approaches for providing project-specific information.

53%

No documentation

Baseline

53%

On-demand retrieval

Skills system

79%

Retrieval + instructions

Enhanced skills

100%

Passive context

AGENTS.md file

The striking finding: On-demand retrieval performed no better than having no documentation at all. The retrieval system existed, but it didn't help. Only when information was passively present did performance improve.

Why Passive Context Wins

Three fundamental advantages over on-demand retrieval

1. No Decision Point

On-demand retrieval requires a decision: “What information do I need for this query?” That decision can be wrong. The model might not realise it needs certain context. It might retrieve the wrong documents. It might retrieve the right documents in the wrong order.

With passive context, there's no decision to get wrong. The information is already there. Every time.

2. Consistent Availability

Retrieval systems are probabilistic. They might return relevant documents 80% of the time, or 60%, or 40%. The quality varies by query, by phrasing, by the state of the vector database.

Passive context is deterministic. The same information is present on every turn. No variance. No “sometimes it works” frustration.

3. No Ordering Issues

With retrieval, critical information might arrive too late in the reasoning process. The model starts generating before realising it needs more context. By the time retrieval happens, the response is already partially committed.

Passive context is present from the first token. The model reasons with full information from the start.

The pattern: Retrieval adds complexity and variance. Passive context adds reliability and consistency. For core information, reliability wins.

The Tradeoff: Why Not Load Everything?

If passive context is better, why not just load all available information? Because context windows have effective limits that are smaller than their technical limits.

The Memento Limit

Research suggests effective reasoning capacity is around 100K tokens, even when context windows are technically larger. Beyond this, performance degrades.

A 200K context window doesn't give you 200K of useful reasoning space. It gives you 100K of effective space with increasing noise.

Lost in the Middle

Models pay more attention to the beginning and end of context. Information in the middle gets weighted less, even when it's critical.

More context can mean important information gets buried where the model is less likely to use it effectively.

The goal isn't maximum context. It's the right context. Passive for what matters most. Retrieval for everything else.

Tiered Context Architecture

The pattern that balances passive reliability with retrieval flexibility

Tier	Type	Token Budget	What Goes Here
Tier 0	Passive	~300 tokens	Compressed state: current status, key metrics, active items
Tier 1	Passive	~1,000 tokens	Active context: navigation, recent decisions, current focus
Tier 2	On-demand	Variable	Domain knowledge: loaded when topic requires it
Tier 3	Retrieval	As needed	Archive: historical, rarely accessed

Passive Foundation

Tier 0 and Tier 1 are always loaded. This is your passive context. Keep it lean (~1,300 tokens total) but ensure it contains everything AI needs to orient itself and navigate effectively.

Retrieval for Depth

Tier 2 and Tier 3 use selective retrieval. Navigation paths in Tier 1 point to relevant Tier 2 content. This gives you depth without bloat.

Implementation Patterns

Practical approaches for passive context systems

The Project File Pattern

A single file (CLAUDE.md, AGENTS.md, or similar) at project root containing everything AI needs to work effectively in that context.

Typical contents:

• Project description and purpose
• Key decisions and constraints
• Build/test commands
• Code conventions
• Current focus areas

The MEMORY + CONTEXT Pattern

Two complementary files: MEMORY.md for compressed state (~300 tokens), CONTEXT.md for active context and navigation (~1,000 tokens).

The split:

• MEMORY: “Where are we?” (status, metrics, active items)
• CONTEXT: “How do I work here?” (navigation, decisions, focus)

The Navigation Hub Pattern

Passive context includes a navigation table: “When you need X, read Y.” This creates predictable paths from topics to relevant files.

Example:

| Topic | Read |
| pricing | docs/pricing-rules.md |
| deployment | docs/deploy-guide.md |

The Token Budget Pattern

Explicit limits on each passive context file. When a file exceeds its budget, compress it. Move detail to Tier 2 and keep pointers in Tier 0-1.

Enforcement:

• Tier 0: Max 300 tokens (hard limit)
• Tier 1: Max 1,000 tokens (soft limit)
• Review weekly, compress as needed

When to Use What

Use Passive Context For

✓Identity and behaviour rules (always needed)
✓Current project state (changes, but always relevant)
✓Navigation pointers (how to find deeper content)
✓Recent decisions (context that's frequently referenced)
✓Session handoff state (what to pick up from last time)

Use Retrieval For

✓Large knowledge bases (too big for passive loading)
✓Historical archives (rarely needed)
✓Domain-specific content (only relevant for certain queries)
✓Reference documentation (detailed specs, APIs)
✓Content that varies by user/session

The combination is powerful: Passive foundation + selective retrieval. Reliability where it matters most. Flexibility where you need depth.

Common Mistakes

Mistake 1: No Passive Context at All

Relying entirely on retrieval. Every query starts with a search. Result: inconsistent baseline, variance in quality, 53% performance.

Fix: Establish a passive foundation, even if it's just 500 tokens.

Mistake 2: Too Much Passive Context

Loading everything passively to avoid retrieval complexity. Result: bloated context, lost-in-the-middle problems, degraded reasoning.

Fix: Enforce token budgets. Compress aggressively. Move detail to Tier 2.

Mistake 3: Stale Passive Context

Setting up passive context once and never updating it. Result: AI references outdated information, makes contradictory decisions.

Fix: Weekly review cadence. Update Tier 0 after every significant change.

Mistake 4: No Navigation to Tier 2

Passive context that doesn't tell AI where to find deeper information. Result: AI either hallucinates or asks repeatedly for guidance.

Fix: Include navigation paths. “When you need X, read Y.”

Getting Started

Create a Tier 0 file

Start with ~300 tokens of compressed state. Current status, key metrics, active items. Name it MEMORY.md or include it at the top of your main context file.

Add navigation to Tier 1

Create a CONTEXT.md with ~1,000 tokens. Include a navigation table: “When topic X comes up, read file Y.” This creates predictable paths to deeper content.

Configure automatic loading

Ensure your AI tool loads Tier 0 and Tier 1 at session start. For Claude Code, this means CLAUDE.md. For other tools, AGENTS.md or equivalent.

Establish maintenance rhythm

Weekly: review passive context for staleness. After significant changes: update Tier 0. Monthly: audit token budgets and compress as needed.

Build Your Passive Foundation

The research is clear: passive context dramatically outperforms on-demand retrieval for core information. The implementation isn't complex – it's a matter of designing the right tiered architecture and maintaining it.

We help organisations design and implement passive context systems that achieve consistent AI performance without retrieval complexity.

Discuss Your Context Architecture ← Back to Context Engineering

Where To Go Next

Context Engineering

The broader discipline of designing what AI knows, when it knows it, and how that knowledge is structured.

Memory Health Protocol

Why AI forgets and how to fix it. The forgetting problem, four memory strategies, and layered architecture patterns.

Agents & Orchestration

Beyond chat: how AI agents can handle complex, multi-step work with the right orchestration and memory.

AI Capability Overview

The full picture of how we help organisations build AI capability – context, skills, and orchestration.

Disclaimer: This content is for general educational and informational purposes only. Research findings cited reflect publicly available sources as of January 2026. For specific guidance on context architecture implementation, please consult appropriately qualified professionals.