Web Text Fragments: Decoding Google AI Mode’s

AI Summary

Google AI Mode and Gemini citations use Web Text Fragments to highlight the exact sentence extracted from a source page. Analysis of 2, 318 URLs found no correlation between text position on a page and its selection for citation, with average citation depth reaching 2, 400 to 4, 600 pixels. These fragments reveal Google's extraction policy, favoring definition-style sentences, statistics with sources, and direct answers.

Every Google AI Mode and Gemini citation URL contains a #:~:text= Web Text Fragment anchor. Decode it and you can see the exact sentence Google extracted from the source page. This is the most underused citation-intelligence signal in 2026 – and it has produced one of the most counterintuitive findings in AI search research.

What Web Text Fragments are

Web Text Fragments (the #:~:text= URL syntax) are a W3C-track browser feature that lets a URL link directly to a specific phrase on a page. When you click a Google AI Mode citation, the browser scrolls to and highlights the exact sentence Google quoted.

That sentence is the literal training-and-retrieval target of the AI engine. Reverse-engineering which sentences get fragment-targeted is reverse-engineering Google’s extraction policy.

The counterintuitive finding: position doesn’t matter

SALT Agency analyzed 2,318 unique URLs cited in AI Mode, recording the vertical pixel position of each highlighted fragment. The headline finding:

Researchers found no correlation between how high text appears on a page and whether Google’s AI selects it for citation. Average citation depth varied by vertical, from 2,400 pixels in travel to 4,600 pixels in SaaS – well below the traditional above-the-fold area.
SALT Agency, 2026

How to extract intelligence from text fragments at scale

Three practical workflows:

Manual reverse-engineering. Run your top 20 prompts in Google AI Mode. Inspect each citation URL, decode the #:~:text= fragment, and catalog which sentences Google chose. Patterns will emerge: definitions, statistics with named sources, lists with numerical structure.
Programmatic collection. Use a scraping tool to collect citation URLs at scale. Decode fragments with a small Python script (urllib.parse.unquote). Store in a database for trend analysis.
Open-source pipeline. The Grounding Citation Analysis project ships a complete pipeline that has analyzed 42,971 citations across 6 AI platforms. Reproducible, open-source, designed for this exact workflow.

What the extracted sentences consistently look like

After analyzing tens of thousands of fragment-extracted passages, recurring patterns:

Definition-style sentences. “X is a Y that does Z.” Concise, complete, decontextualized.
Statistics with sources. “According to [source], 47% of [thing] [does action].”
Direct answers to implicit questions. Sentences that read like featured-snippet answers.
List items. Especially numbered lists where each item is one complete thought.
Quoted experts. Direct quotes from named individuals with sameAs-resolvable identity.

Conversely, what almost never gets extracted: long narrative paragraphs, content with heavy pronouns (“This means…”), sales copy, and AI-generated filler.

Frequently Asked Questions

Are text fragments only used by Google?

The fragment syntax is a browser standard, but Google AI Mode and Gemini citations are the most consistent users. Other AI engines link to source URLs without fragments, hiding which sentence was extracted.

Can I influence which sentence gets fragment-extracted?

Indirectly, yes. The patterns above (definitions, statistics with sources, direct-answer sentences) are extracted at much higher rates. Write at least one sentence per section that fits one of those patterns.

Is there an open-source tool for fragment analysis?

Yes – the Grounding Citation Analysis project decodes fragments at scale across 6 AI platforms and ships sentence-level analysis notebooks. Free, open-source, MIT-licensed.

Want this implemented for your brand?

I help growth-stage companies own their category in AI search. Book a strategy call.

Book a strategy call

Web Text Fragments: Decoding Google AI Mode’s Hidden Citation Signal