Analytics

How to measure AI visibility

A simple monthly check — the same questions, the same tools, a spreadsheet — that tells you whether AI engines know your organization, describe it accurately, and send people your way. No paid software required.

By Matt Updated July 9, 2026

Quick answer

What is AI visibility, and how do you measure it?

AI visibility is how often — and how accurately — AI tools like ChatGPT, Gemini, and Google AI Overviews mention or cite your organization when people ask relevant questions. You measure it by asking a fixed set of 10–15 real questions in each tool every month and recording whether you were mentioned, whether the information was accurate, and whether your website was cited.

Because AI answers vary from run to run, no single check is definitive. The value is in the trend: the same questions, asked the same way, month after month.

Why measuring this matters now

More of your audience's first impressions are being formed inside AI answers instead of on your website. A 2025 Pew Research Center study that tracked the real browsing behavior of 900 U.S. adults found that when a Google search produced an AI summary, people clicked a traditional result only 8% of the time — roughly half the rate of searches without one. When someone asks an AI tool "which food banks near me accept weekend volunteers?" or "is this association's certification respected?", the answer they read may be the only impression they ever get.

Your traditional analytics can't see any of this. Google Analytics shows visits to your site; Search Console shows clicks from Google — but neither tells you what ChatGPT said about your programs yesterday, or whether it named a peer organization instead of you. That's the gap this measurement fills. (One encouraging detail from the same Pew study for public-sector readers: government websites were three times more common as sources in AI summaries than in regular search results — engines actively favor authoritative institutional sources, which is exactly what most organizations in this guide's audience are.)

What a monthly check gives you

Early warning. You find out that an engine is describing a discontinued program, naming the wrong director, or recommending a peer instead of you — before your audience quietly acts on it.
A priority list. The specific questions where you're absent tell your small team exactly which pages to improve next, instead of guessing.
Proof for leadership. A trendline of mentions and citations is the evidence that makes the case for this work — far more persuasive than "AI is important now." (More on presenting it in reporting results to leadership.)

The four numbers worth tracking

Each answers a different question. Together they're your visibility picture.

Mention rate: Of your test questions, how many produced an answer that named your organization at all? This is the baseline — proof the engines know you exist. (More in understanding AI mentions.)
Share of voice: When engines answer questions in your space, how often do they name you versus your closest peers? If a question about services in your county gets answered with three organizations and you're never one of them, that's the finding. (Marketers call this "share of model" or "answer share of voice.")
Citation rate: Being named is good; being linked is better. How often does the answer cite your website as a source? Citations are what send high-intent visitors your way. (How to track them in depth: AI citation tracking.)
Accuracy: Is what the engines say about you actually true and current? Right programs, right locations, right leadership? High visibility with wrong information is worse than low visibility — this is the number that most often demands immediate action.

How to measure it, step by step

The whole routine takes one to two hours a month once it's set up.

1

Write your question list — once

Ten to fifteen real questions, phrased the way people actually ask. Cover three kinds: topic questions your organization should be an answer to ("where can families get food assistance in [county]?"), service questions about what you offer ("does the library have free meeting rooms?"), and branded questions about you by name ("is [organization] a reputable charity?"). Write them down and don't reword them later — consistency is what makes month-to-month comparison meaningful.
2

Pick your engines

Four or five covers the landscape: ChatGPT, Google (watching for AI Overviews), Gemini, and Perplexity — add Claude if you have the time. Free tiers are fine for all of them. Use a fresh or logged-out session where possible so your own history doesn't color the answers.
3

Run the questions and record what you find

Each month, ask every question in every engine and record four things per answer in a spreadsheet: mentioned (yes/no), accurate (yes/partly/no), website cited (yes/no), and any peer organizations named instead. Note the date. Our AI visibility scorecard gives you a ready-made structure for exactly this. Larger organizations tracking hundreds of queries eventually graduate to paid monitoring platforms — but start manual; you'll understand your results far better for having read the actual answers.
4

Turn the tallies into your four numbers

Simple division: mention rate is answers-naming-you over total answers. Share of voice is your mentions over all organization mentions in your topic questions. Citation rate is answers-linking-your-site over total. Accuracy is the share of your mentions that were fully correct. No formulas fancier than a percentage.
5

Watch the trend, not the snapshot

AI answers vary — ask the same question twice and you can get two different responses, and engines change behavior whenever their models update. So a single month means little; three months of the same questions means a lot. Chart the four numbers over time, and judge your content work by whether the lines move.

What measurement reveals in practice

Two representative scenarios showing why the numbers matter.

The association that wasn't in the answer

What the check found

A state professional association runs its monthly questions and finds that "best certification for [profession] in [state]" consistently surfaces a national competitor — sourced from two third-party career sites where the association has no presence at all.

What it changed

Instead of writing more blog posts, the team spends the quarter getting accurately listed on those two career sites and rewriting its certification page to lead with a direct answer. The next quarter's checks show the association appearing alongside the competitor.

The lesson: measurement shows you where engines get their answers — which is often not your website, and not where you were planning to spend effort.

The clinic described as closed

What the check found

A community health clinic's branded question — "does [clinic] offer sliding-scale dental care?" — gets answered with "that program was discontinued." It wasn't; an engine is leaning on a stale 2023 news article.

What it changed

The clinic updates its program page with a dated, answer-first description, adds the program to its structured data, and requests re-indexing. Within two monthly checks, the answers correct themselves.

The lesson: the accuracy number is the one that protects real people — an inaccurate AI answer can turn someone away from a service they qualify for, and without measurement, nobody would ever know.

What this measurement can't tell you

Three honest limits to keep in mind as you read your own numbers.

The attribution gap. Someone can read about your organization in an AI answer, form an impression, and visit you directly a week later — and no analytics tool will connect those dots. If your direct traffic rises while search clicks fall, AI visibility may be working invisibly. (See AI referral traffic for what can be tracked.)

Answer variance. The same question can produce different answers on different runs — it's how these systems work, not a flaw in your method. This is why the routine uses fixed questions and monthly trends rather than treating any single answer as a verdict.

No standard benchmarks yet. This field is young enough that there's no credible "good" mention rate to aim for, and any vendor quoting one is guessing. Your meaningful comparisons are your own baseline (this month versus three months ago) and your closest peer organization — not an industry number.

Set up your measurement this week

Five steps from zero to a running baseline.

Write your 10–15 fixed questions30 min Topic, service, and branded questions, phrased the way your audience actually asks. Then freeze the wording.
Set up the recording spreadsheet15 min One row per question per engine: mentioned, accurate, cited, peers named, date. The scorecard is a ready template.
Run your first full check60–90 min Every question, every engine, fresh sessions. This month's numbers are your baseline — the "before" photo everything else gets compared to.
Fix anything dangerously wrong nowvaries If the check surfaced a factual error about your services or status, correct the source pages immediately — accuracy problems don't wait for next month.
Calendar the monthly repeat2 min Same questions, same engines, same spreadsheet, every month. The trend is the entire product of this work.

One evidence note for the skeptics on your team: this isn't folklore. The original academic research on generative engine optimization — Aggarwal et al., presented at KDD 2024 — found that content changes like adding citations, quotations, and statistics improved a source's visibility in AI-generated answers by up to 40% in benchmark testing. The measurement routine on this page is how you verify those effects on your own organization. And if you'd rather have this whole process run for you, tell me about your situation.

Common questions

Do I need a paid tool to measure AI visibility?

No. For most organizations, a monthly manual check of 10–15 questions across the major AI tools, recorded in a spreadsheet, gives you everything you need. Paid tracking platforms exist and make sense for large organizations monitoring hundreds of queries, but they are an upgrade, not a requirement.

Can I track AI visibility using Google Search Console?

Partially. Search Console counts impressions and clicks from Google AI Overviews inside its normal Search reports, but it does not separate them out, and it tells you nothing about ChatGPT, Gemini, Claude, or Perplexity. It is one useful signal, not a full picture.

How many test questions do I need?

Ten to fifteen is enough to start — a mix of questions about your topic area, your services, and your organization by name. What matters far more than the number is asking the exact same questions the same way every month, so your results are comparable over time.

How often do these numbers change?

Whenever an AI company updates its model or changes how its search retrieval works — which happens without notice. That is why a single check tells you little, and a monthly check of the same questions tells you a lot: you are measuring the trend, not the snapshot.

Does a drop in website traffic mean my AI visibility is low?

Not necessarily — it can mean the opposite. AI tools answer many questions directly, so people may be learning about your organization inside an AI response without ever clicking through. That is exactly why visibility inside the answers needs to be measured separately from traffic to your site.

Continue learning

Turn your numbers into a plan.

Visibility Audits

Get new guides as they're published

Subscribe and get a weekly email with new guides, tips, and important news affecting your AI search marketing. Unsubscribe at any time.

How to measure AI visibility

Why measuring this matters now

The four numbers worth tracking

How to measure it, step by step

Write your question list — once

Pick your engines

Run the questions and record what you find

Turn the tallies into your four numbers

Watch the trend, not the snapshot