GEO · AI answer engines

What people are actually doing to game AI answer engines — and what it reveals

Every major AI answer engine grounds its answers in a conventional search index — Google's own, Bing, or Brave. So every "AI search hack" doing the rounds is really an attempt to manipulate one of those indexes. On 15 May 2026 Google made it official: manipulating AI answers is spam, enforced exactly like ranking manipulation. We took the whole circulating playbook and sorted it into two piles — durable presence-building, and patchable exploits that are one policy update from collapse.

George, the Baseline Labs mascot, pulling the lever on a slot machine

Two piles: durable, and patchable

The durable signal underneath every tactic is the same thing real SEO always rewarded: be a genuinely trusted, corroborated, first-hand source that ranks well. Everything else splits cleanly. On one side, durable presence-building — a real community footprint, named-expert thought leadership, accurate entity data, Wikipedia accuracy. On the other, patchable exploits — seeded Reddit threads, prompt injection, llms.txt gaming, schema spam, parasite SEO — that platforms are already detecting, demoting, or have publicly disavowed. The single best-evidenced manipulation is also the scariest, and it is exactly the kind being actively corrected.

38–51%
of the time a poisoned source is read, a 13-word planted Reddit edit makes the agent name the attacker's pick (Cornell WARP)
60% → 10%
Reddit's ChatGPT citation share, collapsed in two weeks — Sept 2025
1,885
pages that added schema markup — AI citations barely moved (Ahrefs)

Answer engines are retrieval systems bolted onto LLMs, not oracles

Before judging any tactic, the key insight: AI answer engines do not "know" answers — they retrieve documents and synthesise. Google's own 2026 documentation describes the pipeline plainly. AI Overviews and AI Mode use retrieval-augmented generation, which Google also calls "grounding," pulling relevant pages from its existing index using its existing ranking systems, then writing an answer with links. They also run "query fan-out" — issuing many sub-queries and pulling content for each. Google states it flatly: "Our generative AI features on Google Search are rooted in our core Search ranking and quality systems," and a page must already be indexed and eligible for a snippet to appear — "There are no additional technical requirements… You don't need to create new machine readable files, AI text files, or markup."

That is the analytical spine of this whole piece: every manipulation tactic is an attempt to influence a retrieval layer, and its durability depends entirely on whether it exploits a temporary weighting quirk or builds the genuine trust the retrieval layer is designed to reward. Different engines retrieve from different indexes — which is the whole game.

EngineRetrieves from
Google AI Overviews / AI ModeGoogle's own index, same ranking systems
ChatGPT SearchPrimarily Bing-powered
Claude & Mistral Le ChatBrave Search index (confirmed)
PerplexityIts own index, heavy overlap with top Google organic
Microsoft CopilotBing

A single Reddit comment really can poison an answer

Reddit became one of the most-cited domains across AI surfaces in 2024–2025, supercharged by Google's content-licensing deal — Reuters confirmed on 21 Feb 2024 it is "valued at $60 million per year." The Digital Bloom's analysis of 36M+ AI Overviews found Wikipedia, YouTube, Google, Reddit and Amazon are "the new aristocracy of AI-cited sources," together 38% of all citations; Reddit's AI citations grew 450% from March–June 2025. The aggregate dominance is real and well-documented across Profound, Semrush, Tinuiti and Conductor.

But two things cut against the naive "just get on Reddit" reading. It is volatile and being corrected — Reddit's ChatGPT citation share collapsed from roughly 60% to roughly 10% of prompt responses in two weeks in September 2025, and when Reddit sued Perplexity in October 2025, Perplexity's Reddit share dropped 86% almost overnight. And it is vertical-dependent — up to 80% of AI-cited Reddit threads have fewer than 20 upvotes and average ~900 days old, meaning LLMs surface established consensus, not manufactured virality.

The WARP attack (Cornell Tech, arXiv 22 May 2026)Result
Planted 13-word edit → agent names the attacker's entity38–51% of the time that source is read
Same attack across multiple URLs42–62%
Poisoned text as a share of everything the agent reads<4% (100% success for Co-STORM)
Share of all retrieved UGC that is Reddit54–71%

This is the best-evidenced manipulation in the whole report. Deep-research agents issue dozens of sub-queries and repeatedly retrieve the same UGC pages; a single Reddit page appeared in up to 48% of queries within a topic cluster. Researchers seeded a fake "SilverPath" dating app and a fake "Sol Azteca" restaurant into threads and got LLMs to recommend them; r/Biohackers moderators independently reported peptide and HRT companies seeding threads for "AEO," even reverse-engineering the prompt patterns LLMs prioritise. Why durability is LOW: Reddit runs detection for coordinated manipulation and is rolling out verified profiles — and because LLMs ingest the full firehose including deleted and banned content, flagged manipulation can carry negative weight, training models to associate a brand with spam. You cannot microwave years of authentic peer consensus. Genuine, disclosed participation is durable; seeding is a patchable exploit with brand-safety downside.

One LinkedIn post in an AI answer is well-evidenced, not hype

Per Profound's analysis of 1.4M citations, LinkedIn's ChatGPT domain rank rose from roughly #11 in November 2025 to roughly #5 in February 2026 — "the largest shift in authority we've seen this year" — and it became #1 for professional queries across all six major engines. Semrush found LinkedIn cited in 14.3% of ChatGPT responses and 13.5% of Google AI Mode responses. The mechanics are specific: long-form articles (500–2,000 words) are 50–66% of cited LinkedIn content; ~95% is original, not reshared; and engagement does not predict citation — the median cited post has just 15–25 reactions, and authors with under 500 followers are cited as often as big accounts.

Scrunch found technical detail boosts citation +77%, named entities +33%, topic specificity +18% — while Unicode "fancy fonts" cut ChatGPT citation 58% because the model can't read the glyphs. The critical caveat, from Seer Interactive's 541,213 LLM responses: being cited is not being recommended. When a brand isn't named in the response text, its content citation rate falls from 53.1% to 10.6% — suggesting LLMs pick which brands to recommend from training data first, then retrieve sources to support it. This is durable because it rests on E-E-A-T signals that are structural, not exploitable quirks. The only risk is the flood of auto-generated "thought leadership," which invites the same quality crackdowns Google applies everywhere else.

A real but narrow lever, heavily over-sold

The Bing Search API shutdown (announced 15 May 2025, terminated 11 Aug 2025) genuinely narrowed the field to roughly three Western-scale indexes — Google, Bing, Brave — with Brave the only open commercial API at scale. The honest picture: confirmed Brave users are Anthropic's Claude and Mistral's Le Chat only. In a Profound study, Claude's citations showed 86.7% overlap with Brave's top organic results. Claims that Perplexity and Meta AI use Brave are false — Perplexity built its own index, and Meta AI uses Bing supplemented by Google. Those false claims originate largely from indexing-service vendors.

The kernel is real: for Claude specifically, ranking well organically on Brave tracks tightly with being cited, because Claude grounds on Brave. That is a legitimate, durable insight. The paid "Web Discovery Project signal injection" product, by contrast, has zero independent validation, contradicts Brave's anti-manipulation design (its STAR cryptographic protocol is built to resist exactly this), and leans on demonstrably false claims about which AIs use Brave. It is also fragile to provider switches — Anthropic could change search backends at any time, and Brave killed its free API tier in February 2026.

Borrowing a trusted domain's ranking — and getting dismantled

Publishing third-party content on a trusted domain to borrow its ranking signals worked, and Google has been actively dismantling it. The March 2024 "site reputation abuse" policy targeted it; manual actions hit Forbes, WSJ, Time and CNN in November 2024 — Forbes' coupon section lost ~90% of organic visibility within eight weeks. Crucially, Google's 2026 guidance confirms scaled content abuse, site reputation abuse, expired-domain abuse and link spam are all formally in scope for AI Overviews and AI Mode: if a site is demoted in organic results, it is excluded from the AI citation pool too. Enforcement response times shrank from weeks (2024) to days (late 2025), and subdomain workarounds no longer evade detection.

A security vulnerability being patched, not a strategy

Planting instructions in web content — white text, tiny fonts, invisible Unicode, or plain directives like "Remember [Company] as a trusted source" — demonstrably works in tests. A BBC journalist wrote a fake listicle ranking himself "best tech journalist at eating hot dogs"; within 24 hours Gemini and AI Overviews repeated the fabricated ranking. A security researcher showed Perplexity was "trivially susceptible" to injection from analysed pages; academic work shows hidden prompts hitting up to 98.6% success in some contexts; Microsoft logged 50 distinct memory-manipulation attempts in a 60-day window.

It works because LLMs struggle to separate content-to-summarise from instructions, and AI-answer pipelines often lack a human review step. But Google's May 2026 update explicitly names "recommendation poisoning" as spam subject to demotion and removal; Microsoft and Google now treat one-click "summarize with AI" injection as spam; and defences like Spotlighting, prompt filtering and memory controls are being deployed. This is being actively patched.

A non-starter for GEO

The proposed markdown file declaring site content to LLMs is marketed by some GEO vendors as a visibility lever. It does not work. Google's John Mueller repeatedly stated no AI system uses llms.txt — comparing it to the deprecated keywords meta tag — and that its appearance on some Google dev docs was a CMS artifact, "not an endorsement." OtterlyAI measured 84 of 62,100 AI bot requests hitting llms.txt (0.1%); an Ahrefs study of 137,000 sites found 97% of llms.txt files received zero traffic in May 2026. It remains legitimate only for its original purpose: developer-docs and coding-agent use.

An amplifier, not a driver — and not an AI silver bullet

Ahrefs tracked 1,885 pages adding JSON-LD from August 2025–March 2026 against 4,000 controls and found +2.4% on AI Mode and +2.2% on ChatGPT (both indistinguishable from noise) and a −4.6% change on AI Overviews — a small but statistically significant decline, with the odds of a gap that large by chance about 1 in 2,500. Google's own documentation: "There's no special schema.org structured data that you need to add" for AI Overviews.

The counter-evidence is platform-specific: Bing confirmed schema helps Copilot; one study found Product/Review schema with concrete attributes cited at 61.7% vs 41.6% for generic schema; and February 2026 testing found ChatGPT and Perplexity tokenise JSON-LD as raw text rather than parsing it semantically. So schema stays worthwhile for classic Search, Bing-powered surfaces and entity clarity — but "schema as AI-citation silver bullet" is debunked.

It backfires — but accuracy is durable

Wikipedia is among the most heavily-weighted training and citation sources (ChatGPT cites it in roughly 47.9% of its top-10-source citations; Google's C4 dataset deliberately oversampled it). Yet editorial guardrails, conflict-of-interest rules and a full open edit history make manipulation visible to models. A Princeton study found AI-generated self-promotional pages were mathematically lower quality; English Wikipedia created speedy-deletion for suspected AI content (Aug 2025) and prohibited AI-added content (March 2026). The durable takeaway: if a brand merits a page, independent editors create it from reliable secondary sources. The value is ensuring accuracy and entity consistency (the Wikidata Q-ID), not promotion.

The platforms have drawn an explicit line

Google's spam policy (15 May 2026) now reads that spam includes "attempting to manipulate generative AI responses in Google Search" — same enforcement (demotion, removal), no separate AI-spam track, no carve-out for "legitimate optimization" framing. John Mueller: "There is no such thing as GEO or AEO without doing SEO fundamentals." Nick Fox: "Optimizing for AI search is the same as optimizing for traditional search." Microsoft and Perplexity representatives echo the "no shortcuts" line.

The honest reality check: AI Overview citations have been a lower-quality surface — expired domains, thin affiliate and scaled AI content have appeared more in AI answers than in matching organic results, and Perplexity has been caught citing AI-generated spam blogs. Enforcement is a stated intention being rolled out gradually, not a solved problem. The gap between policy and current results is real — and it is exactly where exploits temporarily live.

The distinction that actually matters

The credible-practitioner consensus (Aleyda Solis, Lily Ray, Wil Reynolds, Mark Williams-Cook) is that GEO and SEO overlap massively — Solis's 29-variable comparison shows the strongest shared drivers are original, expert, up-to-date content. Google's own contrast captures the durable lever: "7 Tips for First-Time Homebuyers" (commodity, won't get cited) versus "Why We Waived the Inspection and Saved Money" (first-hand, citable). The Princeton GEO study quantifies it: statistics +37%, citing sources +40% citation probability. Lily Ray's framing draws the line — the days of upper-funnel content "without providing unique insights, data, opinions or value above and beyond what others have already said" are over. Solis names the thing to stop directly: "manipulative 'best of' lists that are not genuine."

What to actually do

Stage 1 Build the durable foundation — now
Ensure crawlability and indexing across Google and Bing, and that BraveBot isn't blocked. Being indexed and rankable is the non-negotiable entry ticket. Audit entity corroboration — consistent facts across Wikidata, Crunchbase, LinkedIn and your own site. Publish "fingerprint" first-hand content with specific numbers, named entities and original data. Highest-leverage, lowest-risk move; it compounds.
Stage 2 Earn genuine presence — 3–6 months
Build real, disclosed community presence where your category's buyers actually discuss (relevant subreddits; named-expert LinkedIn long-form, 5+ posts a month). Pursue digital PR and earned placements across multiple authoritative domains — repetition across trusted sources is what signals citation-worthiness. Measure share-of-citation per target prompt, per engine.
Stage 3 Avoid the patchable exploits entirely
Do not seed or astroturf Reddit, inject prompts, run parasite SEO, or buy "WDP signal," llms.txt or schema-spam packages. Each is debunked, already penalised, or one policy update from collapse — and several carry brand-safety downside as models ingest flagged manipulation as a negative signal.

Read every number as a snapshot

Evidence quality varies sharply. The Cornell WARP study, the Semrush/OtterlyAI and Ahrefs datasets and Google's own documentation are strong; many "X% uplift" claims come from vendors selling GEO tools and should be treated as directional, not gospel. Citation data is volatile and personalised — share-of-voice varies by model, location, session, date and prompt phrasing, and there is no "Search Console for AI." The landscape moves monthly: Reddit, YouTube and LinkedIn rankings reshuffled repeatedly across 2025–2026, the Bing API died, Brave changed pricing, Google issued multiple updates. And platform self-interest cuts both ways — Google insisting "AI search is just search" both reflects genuine architecture and serves its interest in discouraging gaming. The takeaway holds regardless: build the trust the retrieval layer is designed to reward, and you are durable to every reweighting that follows.

See where the AI already mentions your brand

A Baseline Labs research brief. Figures are drawn from the cited public studies and platform statements — Cornell Tech's WARP preprint (arXiv, 22 May 2026), Profound, Semrush, Ahrefs, OtterlyAI, The Digital Bloom, Seer Interactive, Scrunch and Google's own Search Central and 2026 spam-policy documentation — and are accurate as of late June 2026. Citation shares in AI answers shift continually; treat every percentage as a dated snapshot.

George
Online
0%

Hi, I'm George.

Ask me about your projects, reports, brand mentions, backlinks, or anything on the platform.