AI Codex
Retrieval & KnowledgeRole-Specific

Do you actually need RAG? The decision most operators get wrong

Most teams jump to RAG because it sounds like the right answer. Half of them didn't need it. Here's how to know which situation you're in — before you build anything.

5 min read·RAG

The most common mistake in AI implementation isn't building the wrong thing. It's building something you didn't need at all.

RAG — connecting your documents and data to Claude — is genuinely powerful. It's also genuinely complex to do well. Before you build a RAG pipeline, it's worth being honest about whether you actually need one.

The question to start with

Can Claude answer your users' questions accurately without any additional information?

If your use case is general — writing, editing, summarising, brainstorming, answering questions about publicly known topics — Claude's training data is probably sufficient. RAG adds cost and complexity without meaningfully better answers.

If your use case requires specific knowledge Claude doesn't have — your product documentation, your internal processes, your customer data, events after Claude's training cutoff — then you genuinely need a way to get that information in front of Claude. RAG is the standard answer.

The tell: if you find yourself writing long paragraphs in your system prompt trying to describe your company's situation, you're fighting a symptom of needing RAG. The cure isn't a longer system prompt.

Three situations where RAG is the right call

Your knowledge base changes frequently. Price lists, product updates, policy changes, team information. RAG retrieves this fresh every time. A system prompt can only be updated when you push a new version.

Your information is too large to paste in. Anthropic's entire documentation. Your complete CRM history. Five years of support tickets. These can't live in a context window. RAG finds the relevant slice and passes only that.

You need answers tied to specific sources. Customer-facing products especially — where "according to our returns policy..." is a different kind of answer than "generally speaking, returns policies tend to..." RAG gives Claude something concrete to cite.

Three situations where RAG is overkill

You have a small, stable knowledge base. If your company's relevant information fits in 2,000 words and changes rarely, put it in your system prompt. A well-structured system prompt is faster to build, easier to maintain, and performs just as well.

Your questions are mostly general. A writing assistant, a brainstorming tool, an interview coach — these don't need your proprietary data. They need Claude to be good at Claude's job. Don't add complexity to a problem that doesn't exist.

You're in week one. RAG requires building retrieval infrastructure, chunking your documents thoughtfully, running evaluations. If you're still figuring out whether AI will work for your use case at all, build the simplest possible thing first. You can add RAG when you've validated the basic interaction.

The honest middle path

Start with a context-stuffed prompt. Take your knowledge base, paste the most relevant parts into Claude's context window, and see how well it performs. Claude's 200K token window is large enough to hold most small company knowledge bases entirely.

If that works well and your knowledge base is stable, you're done. Ship it.

If you find yourself hitting limits — too much content, things going stale, needing citations — that's the signal to build proper RAG. Now you're building it because you know you need it, not because it sounded like the right architecture.

Before you build: the five-question checklist

  1. Does Claude need information it wasn't trained on to answer your users' questions?
  2. Is that information more than roughly 50 pages of text?
  3. Does it change frequently enough that a system prompt would go stale?
  4. Do your users need answers tied to specific sources, not general knowledge?
  5. Have you validated the basic interaction works before adding retrieval complexity?

If you answered yes to most of these, build RAG. If you answered no to most, start simpler. The best RAG system is often the one you didn't build yet.


Further reading