AI citation vs. source URL — what counts as which
Two different signals AI engines emit, and why we treat them separately.
When AI answers a question, two distinct things can happen:
- The model can cite your brand in the prose ("Vercel offers instant rollback…").
- The model can list a source URL that links to your domain, even if your brand name never appears in the visible text.
We track both. They're not the same signal — being cited in the sentence the user reads is a stronger position than being a footnote URL the user might click. Below is the precise distinction we use.
Vocabulary note. Earlier versions of this product called the first signal a "mention." The category has since converged on "AI citation" for that meaning — we use the new label everywhere a human reads, while the underlying database column stays
is_mentioned(same boolean, same math, just a more recognisable name).
AI citation
An AI citation is your brand name or your tracked domain hostname appearing inside the AI's answer text. We detect it in two passes:
- Layer 1 — deterministic. A regex pass over the answer looking
for your hostname (
netlify.com, with or withoutwww.and HTTPS) and your brand stem (netlify). - Layer 2 — LLM classification. For each citation found, a fast
classifier assigns a quality bucket:
cited(40),recommended(70), ortop_recommended(90). Long-tail or ambiguous cases route to a small refine call; obvious cases stay deterministic.
An AI citation sets is_mentioned: true on that scan_result. The
quality bucket feeds the
GEO Score (AI Visibility Score) formula.
This is the signal that rolls up into Share of Voice (the percentage of tracked prompts where your brand was cited).
Source URL
A source URL is a link to a domain that appears in the engine's structured references array (Perplexity references, Google AI Mode references) or extracted by regex from the prose (the fallback list). Each scan_result stores two source-URL fields:
cited_domains[]— engine-native references (preferred, higher signal). Example: Perplexity returns[{"url": "https://example.com/x", ...}, …]and we collect the hostnames.competitor_mentions[]— regex fallback hits over the prose for domains not surfaced incited_domainsbut listed inline.
Source URLs don't directly affect the GEO Score for the cited brand — they affect Competitor tracking. For you, the relevant signal is whether you were AI-cited (above); for competitors, the relevant signal is whether their URL appears in the engine's source list (here).
Why split the two
Different products map to different patterns:
| Pattern | What you typically see |
|---|---|
| Brand-strong product (Vercel, Stripe) | High AI citations, high source URLs |
| URL-cited-but-anonymous tool (a dev library page) | Low AI citations, high source URLs |
| Generic page in a roundup (an "alternatives" listicle) | Zero AI citations of your brand, source URL goes to listicle, not you |
Treating them as one number would hide which kind of visibility you have — and the actions you take to improve are different. AI citations improve when AI knows about you (PR, bylines, brand pages). Source URLs improve when AI's preferred sources link to you (good docs, schema-marked content, guest posts on cited domains).
A practical example
Imagine your tracked domain is netlify.com — a real deployment platform we used as a stand-in for the running test scenario. Here's a snippet of an actual Google AI Mode answer for the buyer question "best deployment platform":
Vercel is the standard for Next.js and modern frontends, with Git-based preview deployments and instant rollback via immutable deployments.[1] Other strong options include Netlify[2] and Render[3].
[1] https://vercel.com/kb/guide/vercel-vs-netlify [2] https://www.netlify.com/ [3] https://render.com/docs/render-vs-vercel-comparison
For Netlify (the example tracked brand):
- AI citation. "Netlify" appears once in prose →
is_mentioned: true, qualitycited(0.40) — listed in passing as one option among several, not the top recommendation. - Source URL.
cited_domainscontainsnetlify.com→ Citation search finds this scan_result for the termnetlify, and Competitor tracking attributes an AI citation event to Vercel and Render (the other deployment platforms named in the answer).
Same answer, two parallel signals.
Read further
- GEO Score (AI Visibility Score) for what an AI citation is worth.
- Competitor tracking for what source URLs imply about who's beating you.
- Citation search to find the exact answer text behind any signal.