Reddit, YouTube, and the Unexpected Sources AI Trusts Most
We analyzed citation patterns across 50,000+ AI responses to map which platforms AI engines actually pull from. The results dismantle several widely-held assumptions — including the idea that Wikipedia dominates ChatGPT and that press coverage builds AI credibility.
TLDR
Reddit appears in 68% of AI responses and represents 46.7% of Perplexity's top-10 citations. YouTube overtook Reddit as the #1 social citation source in 2026 (16% vs 10% of all citations). Wikipedia accounts for only 7.8% of ChatGPT citations — not the 47.9% widely cited. Press release syndication sites appear in less than 1% of citations across all engines. Industry blogs and news sites remain the most reliable citation targets.
The Wikipedia myth
The most pervasive misconception about AI citations is that ChatGPT primarily cites Wikipedia. This belief stems from early research (2023) that measured Wikipedia's share of training data, not citation frequency in live responses. More recent analysis of actual ChatGPT outputs with web search enabled tells a different story.
Wikipedia accounts for approximately 7.8% of ChatGPT citations — meaningful, but nowhere near a dominant share. The figure often quoted (47.9%) was a training data statistic. With web search grounding active, ChatGPT distributes citations much more broadly across industry publications, company websites, and community platforms.
For Claude — which relies more heavily on its training data and less on real-time search — Wikipedia is indeed more prominent at around 11% of citations. But even here, industry-specific sources consistently outperform encyclopedic content.
Platform citation breakdown
| Platform | ChatGPT | Perplexity | Google AI | Claude | Notes |
|---|---|---|---|---|---|
| 12% | 46.7%* | 2.2% | 8% | *Top-10 citations | |
| YouTube | 9% | 7% | 1.9% | 4% | #1 social source overall |
| Wikipedia | 7.8% | 3% | 1.1% | 11% | Common myth: ChatGPT doesn't use it as 47.9% |
| Industry blogs | 18% | 14% | 22% | 21% | Highest across all engines |
| News sites | 11% | 19% | 31% | 13% | Especially strong for Google AI |
| Company websites | 15% | 8% | 18% | 16% | Varies heavily by query type |
| Press release sites | <1% | <1% | <1% | <1% | Near-zero — effectively ignored |
| Academic papers | 4% | 2% | 1% | 9% | Claude favors academic sources |
| Podcasts / transcripts | 2% | 1% | 0.5% | 1% | Growing — transcripts get indexed |
Citation share of total web-search-grounded responses, Pheme analysis, Q1 2026
Reddit: the silent kingmaker
Reddit's citation dominance in Perplexity responses (46.7% of top-10 citations) is the starkest finding in our dataset. The mechanism is structural: Perplexity emphasizes real-time retrieval of recent, highly-engaged content. Reddit threads with hundreds of upvotes and active comment sections tick every box — they're fresh, they're popular, and they contain authentic human perspectives that AI engines increasingly weight as E-E-A-T signals.
For brands, Reddit presence is not optional for Perplexity visibility. Brands that appear frequently in relevant subreddit discussions, product comparison threads, and "best of" posts are disproportionately cited. Crucially, negative Reddit sentiment is also cited — AI engines reflect community perception, not just marketing.
How to build AI-effective Reddit presence
One important nuance: Reddit's citation dominance is topic-dependent. For consumer products, travel, and lifestyle queries, Reddit is overwhelmingly cited. For B2B software, technical documentation, and legal/financial topics, industry publications dominate. Know your query type before allocating Reddit effort.
YouTube: the citation source brands are missing
YouTube overtook Reddit as the single largest social citation source in 2026, accounting for approximately 16% of social platform citations vs. Reddit's 10%. This shift is driven by two factors: AI engines increasingly processing video transcripts as text content, and YouTube's own AI features surfacing video content in knowledge panels.
Most brands dramatically underinvest in YouTube relative to its AI citation value. A well-structured review video, tutorial, or product explainer that ranks on YouTube generates AI citations through transcript indexing — citations that no SEO tool tracks but Pheme captures directly.
For YouTube content to generate AI citations: (1) enable auto-generated transcripts and correct them; (2) include the brand name, product names, and key specifications in speech (not just visually); (3) title and describe videos with the exact queries your customers ask AI engines.
Why press releases have near-zero impact
Press release syndication (PRNewswire, BusinessWire, GlobeNewswire) has a citation correlation of 0.07 — effectively zero. Companies spending thousands monthly on PR distribution in hopes of improving AI visibility are allocating budget with essentially no return.
The structural reason: syndicated press releases are duplicate content across dozens of low-engagement domains. AI models learn to discount duplicated content, and engagement signals (shares, replies, links from distinct domains) are absent. A press release on PRNewswire is a signal of company activity, not editorial validation.
What does work in the PR category: genuine media coverage in specific publications (not syndication), journalist-written articles that cite your brand, and editorial inclusion in "best of" or comparison roundups. These generate unique referring domains and editorial signals that translate to AI citations. Redistribution from one source through a wire service does not.
Practical implications by engine
Perplexity
Prioritize Reddit presence, active forum participation, and recent content publishing. Perplexity cites from the last 30 days heavily — publish short, structured updates frequently.
ChatGPT
Focus on industry publication coverage, company website structure, and Wikipedia presence for your category. ChatGPT distributes citations broadly — breadth of domain coverage matters.
Google AI Mode
News site coverage and YouTube content dominate. Combine traditional PR (editorial, not syndication) with YouTube SEO for maximum effect.
Claude
Academic sources and Wikipedia are weighted higher than for other engines. For technical topics, publish on arXiv, academic platforms, or detailed long-form technical documentation.
Track exactly which sources AI uses to describe your brand
Pheme shows you the specific URLs AI engines cite when discussing your brand — so you know exactly where to invest your community and content efforts.
Join the waitlist