Skip to main content

Overview

Source metrics track how AI models use and cite content sources when generating responses. These metrics reveal which domains have content authority — the ability to influence what AI search engines say. Sources are analyzed at two levels:
  • Domain level — aggregated across all URLs from a domain (e.g., all pages on techcrunch.com)
  • URL level — individual page performance (e.g., techcrunch.com/article-1)

Citation count: total references to a source

Definition: Total number of times a source is explicitly referenced in AI responses.

Formula

Citation count = Total citations of the domain (or URL)
Each citation in a response counts separately. If one response cites two different URLs from the same domain, that’s two citations for the domain.
Domain: techcrunch.com
Time period: Last 7 days

Response 1: Cited techcrunch.com/article-1, techcrunch.com/article-2 (2 citations)
Response 2: Cited techcrunch.com/article-1 (1 citation)
Response 3: Cited techcrunch.com/article-3 (1 citation)
Response 4: No citations from techcrunch.com

Citation count = 2 + 1 + 1 = 4
Content from techcrunch.com was explicitly cited 4 times across 3 responses.

Average citations per response

Definition: The average number of times a source is cited per AI response where it appears.

Formula

Average citations = Total citations of source / Responses where source was cited
Higher values indicate deeper trust — when an AI model uses the source, it tends to cite multiple pages or cite it multiple times.
Domain: techcrunch.com
Time period: Last 7 days

Response 1: 2 citations from techcrunch.com
Response 2: 1 citation from techcrunch.com
Response 3: 1 citation from techcrunch.com

Total: 4 citations across 3 responses

Average citations = 4 / 3 = 1.33
When AI uses techcrunch.com, it cites an average of 1.33 unique URLs from the domain per response.

Used: how often a source appears

Definition: The share of AI responses that cited this source within the selected time window. The same definition applies at both domain and URL level.

Formula

Used (%) = (Responses that cited the source / Total responses analyzed) x 100
Citation depth — how many URLs from the same domain show up when it’s cited — is reported separately as average citations per response (defined above).
Domain: techcrunch.com
Time period: Last 7 days

Response 1: Cited 2 URLs from techcrunch.com
Response 2: Cited 1 URL from techcrunch.com
Response 3-100: No citations from techcrunch.com

Responses citing techcrunch.com = 2
Total responses analyzed = 100

Used = (2 / 100) x 100 = 2%
techcrunch.com appeared as a source in 2% of AI responses for this project. Across those 2 responses, it averaged 1.5 unique URLs cited per response (reported separately as avg citations).
URL: techcrunch.com/article-1
Time period: Last 7 days

Response 1: Cited this URL
Response 2: Cited this URL
Responses 3-100: Did not cite this URL

Used = (2 / 100) x 100 = 2%
This specific article was cited in 2% of responses, regardless of how many times it was cited within each one.
The “used” metric is your share-of-voice equivalent for sources: it tells you how often AI reaches for this source when answering questions in your space.

Source categories

Topify.ai classifies cited domains into categories to help you understand the source landscape:
CategoryExamplesDescription
Corporateapple.com, tesla.comOfficial brand and company websites
Editorialnytimes.com, techcrunch.comNews outlets and journalism
Institutional.gov, .edu, who.intGovernment, academic, and institutional sites
Referencewikipedia.org, stackoverflow.comKnowledge bases and reference platforms
UGCreddit.com, youtube.com, medium.comUser-generated content platforms
OtherEdge cases, parked domainsEverything that doesn’t fit the above
These categories help you understand whether AI providers are citing authoritative sources, user-generated content, or your competitors’ sites when answering prompts related to your brand.

Using sources data to improve your strategy

Sources data helps you answer questions like:
  • Which of your pages are already being cited? Look for your domain in the top domains list. High citation counts mean AI providers trust your content.
  • Where are the gaps? If competitor domains are cited frequently but yours isn’t, you need content that covers those topics.
  • What content types perform best? Compare citation rates across source categories to understand what AI providers prefer to cite.
  • Which pages should you optimize? URLs with high “used” counts are already influencing AI responses — improving them could increase your visibility further.

FAQ

How do I get my own pages cited more often?

Look at the domains AI currently cites most for your prompts and study what they have in common: structured headings that match the question, statistics and named numbers, citations to authoritative sources, and a confident tone. Then write content that does those things better. The AI agent’s geo_optimize skill applies these techniques to existing pages automatically — see Using the AI agent.

My domain isn’t in the source list — why?

Either AI providers haven’t picked up your pages yet, or they have but cite other sources first when answering your tracked prompts. Both are content problems, not data problems: you can fix them by publishing pages targeted at the prompts where your domain is missing, or by improving the pages you already have so they’re more citable.