Agent Surface & Observability
AI-representation observability — watching how assistants describe you
The blind spot
You almost certainly measure how Google sees you — rankings, impressions, clicks. You almost certainly don't measure how ChatGPT, Claude, Gemini, or Perplexity describe you. Yet for a growing share of your prospects, the assistant's summary is the first impression. If it's wrong — outdated pricing, a discontinued product, a competitor named as the market leader — you're losing deals you never see.
This is the observability half of the Agent Surface & Observability pillar, criterion AS3. It closes the loop on share-of-model: citeability is how you earn a place in the answer; observability is how you check what that answer actually says.
What to measure
Five things, in rough priority order:
- Accuracy — is what the assistant says about you factually correct?
- Citation — does it name and link you, or describe you anonymously?
- Share-of-model — across your key questions, how often do you appear versus competitors?
- Hallucinations — does it invent products, claims, or policies you don't have?
- AI-referral traffic — how many visits now arrive from assistant links, and to which pages?
How to run a prompt-test panel
You don't need a platform to start — you need a process:
- Write the 20 questions a real prospect would ask an assistant in your category, from "best tool for X" to "does your-brand support Y?"
- Run them across the major assistants, the same way each time.
- Log the verbatim responses: what's said, who's cited, what's wrong.
- Repeat on a cadence — monthly is a sensible start — so you see movement, not just a snapshot.
That logged, dated record is your baseline and your evidence. It's also exactly the kind of repeatable check the Phase 3 automated audit is built to run for you.
Closing the loop: fix it at the source
Observability only pays off if it drives action. When you find a misrepresentation, the fix is almost never to argue with the model — it's to correct the substrate the model learned from:
- Wrong facts? Publish the correct, dated, canonical statement prominently and in structured data.
- Anonymous mentions? Strengthen your entity presence and authorship signals so you're attributable.
- Stale pricing or products? Update the source pages and their
dateModified; engines weight freshness.
Fix the source, re-test next cycle, confirm the answer moved. That feedback loop — measure, correct, re-measure — is the whole discipline.
What to do this week
- Write your 20 prospect questions and run them across the major assistants today.
- Log every response verbatim, flagging inaccuracies and missing citations.
- Pick the single worst misrepresentation and fix it at the source this week.
- Diarise a re-test for next month — observability is a loop, not a one-off.
Get the Radar
Monthly updates on AI web standards, so you don't have to track them yourself.
No spam. Unsubscribe anytime. We handle your email per our Privacy Policy.