Agents & OrchestrationField Note

When Claude starts doing the work: what AI agents look like in practice

In brief

An agent isn't just a chatbot that can click buttons. It's a fundamentally different relationship between a human and an AI. Here's what that looks like when it's working.

5 min read·AI Agent

Contents

♡Sign in to save

Most interactions with Claude are transactional: you ask, Claude answers. An AI agent is something different — Claude working through a multi-step task on its own, making decisions along the way, and producing an outcome rather than just an answer.

Here's what that actually looks like when it works, and what to watch for.

A concrete example: a sales research agent

The task: before every sales call, produce a briefing on the prospect — their company, recent news, likely pain points, relevant competitors they might already use.

Old process: sales rep spends 30-45 minutes on LinkedIn, Crunchbase, the company's website, and Google News before each call.

Agent process: give Claude the prospect's company name and domain. Claude searches the web, reads recent press releases and news, checks for job postings (a good signal for growth areas and pain points), looks at their product pages and pricing. Synthesises everything into a structured briefing. The whole thing runs while the rep is on their previous call.

The rep still reads the briefing and adds judgment. But the 30-minute research task is now a 2-minute review task.

What makes this different from a regular prompt

The agent doesn't just answer one question — it completes a multi-step workflow. It decides what to search, reads the results, decides what's relevant, decides what to search next based on what it found, and synthesises the whole thing. At each step it's making judgment calls without human input.

That autonomy is what makes agents powerful. It's also what makes them require more care to set up.

The failure mode to watch for

Agents fail in specific ways that regular prompts don't. The most common: a step in the middle goes wrong — Claude retrieves the wrong information, misinterprets a result, or follows an edge case — and all subsequent steps build on the error. The final output looks plausible but is wrong in ways that are hard to catch.

This is why good agent implementations include:

Checkpoints where a human reviews intermediate outputs
Clear scope limits (what Claude can and can't do without approval)
Logging of every step so you can diagnose failures

Don't build an agent that takes irreversible actions without human approval. Start with agents that produce outputs for human review. Automate the approval step only after you've established that the outputs are reliably good.

Where to start

The best early agent use cases are research-heavy, produce a document or summary (not an action), and have a clear definition of what "done" looks like. Sales briefings, competitive research, summarising large volumes of information, monitoring for specific events.

The cases to approach carefully: anything that takes actions in external systems, sends communications on your behalf, or modifies data.

What to read next

All articles →

AI Agent·Core Definition·5 min

When Claude stops answering and starts doing

There's a clean line between a model that responds to questions and one that takes actions in the world. Understanding that line is the most important thing to know about building with AI right now.

AI Agent·Failure Modes·6 min

Why most AI agent pilots fail in the first month

Building an AI agent that demos well is easy. Building one that works reliably in production is hard. The gap between the two is almost always one of the same five problems.

How It Works·10 min

Building AI Skills: Turning Repeated Workflows into Reusable Tools Your Whole Team Can Use

A skill is a reusable, parameterized AI workflow stored as a markdown file — a function call where the function is a multi-step AI process. Teams use them to encode institutional knowledge so every team member gets the same quality result on demand, not just the person who figured it out first.