Stop chasing Reddit and Wikipedia: What actually drives AI recommendations

The digital marketing landscape is currently obsessed with a specific set of viral charts. You have likely seen them on LinkedIn or in industry newsletters: bar graphs showing Wikipedia and Reddit as the undisputed kings of AI citations. These studies, including recent comprehensive research from Semrush, confirm that major Large Language Model (LLM) platforms like ChatGPT, Claude, and Perplexity lean heavily on these massive domains to anchor their responses.

For many Chief Marketing Officers (CMOs) and SEO directors, the takeaway seems obvious. If the AI is citing Reddit, we must “do Reddit.” This has led to a gold rush for “Reddit SEO” agencies and a frantic attempt to manufacture presence on community-driven platforms. However, taking this aggregate data and pivoting your entire Generative Engine Optimization (GEO) strategy toward these giants is a fundamental strategic error for the vast majority of B2B and niche B2C brands.

While the algorithmic tide is indeed shifting toward community forums and open-source knowledge bases, the way this shift is being interpreted by the marketing industry is largely misguided. To win in the era of AI search, you need to stop chasing aggregate citations and start understanding the nuances of how LLMs determine “ground truth.”

Why the Reddit hype is misleading executive strategy

The charts driving executive FOMO (fear of missing out) are mathematically accurate, but they lack the necessary context for high-stakes business strategy. When a study looks at the top-cited domains across an entire LLM database, it is analyzing hundreds of thousands of randomized keywords. These range from “how to boil an egg” and “Marvel movie timelines” to “the history of the Roman Empire.”

As industry expert Alex Birkett has pointed out, Wikipedia, Reddit, and YouTube are cited so frequently because they are massive websites with a topical footprint that spans millions of different areas. By default, they will always win the aggregate numbers game. If an AI needs a general definition or a broad public consensus, it goes to the places where the most human data exists. This does not mean these platforms are the primary drivers for a buyer looking for specialized enterprise software or professional services.

The current obsession with Reddit specifically stems from the perception that it is an “SEO loophole.” While marketers respect the nearly impenetrable editorial guardrails of Wikipedia, they view Reddit as a playground where they can manufacture sentiment. This has resulted in a classic case of marketing whiplash: teams are abandoning foundational content principles to chase a shiny new object that they don’t fully understand.

Macro studies vs. micro intent

The core problem with following macro studies is that they ignore search intent. A study that aggregates 100,000 queries will inevitably be weighted toward top-of-funnel (TOFU) and informational queries. In those categories, Wikipedia is an unbeatable authority. However, for bottom-of-funnel (BOFU) queries—the ones that actually drive revenue—the AI’s behavior changes significantly.

When you see a Reddit thread ranking at the top of a Search Engine Results Page (SERP) or being cited by an AI for a “best software” query, it isn’t an accident or a hack. It is often the result of years of authentic, unprompted human discussion. This “voice of the customer” is what the AI is seeking. Your marketing team cannot “microwave” three years of organic brand sentiment into a two-week Reddit campaign. Trying to do so ignores the historical context that LLMs value.

The illusion of hacking Reddit and Wikipedia for AI visibility

If you decide to ignore the macro context and pursue a Reddit-first strategy anyway, you will quickly run into the technical reality of how LLMs process information. Hacking these platforms for citations is an illusion built on a fundamental misunderstanding of AI training and data ingestion.

Historical consensus cannot be manufactured

Many “Reddit SEO” agencies promise to trigger AI visibility by generating hundreds of upvotes and comments on specific threads. However, the data suggests that LLMs do not care about manufactured virality. According to Semrush research, up to 80% of Reddit threads cited by AI tools have fewer than 20 upvotes. More importantly, the average age of a cited post is approximately 900 days.

This reveals a critical truth: LLMs are not looking for what is trending today; they are looking for established, historical consensus. They prefer threads that have stood the test of time and have been validated by a community over a period of years, not hours. A sudden burst of activity from new accounts is more likely to be flagged as noise than to be treated as a signal of authority.

The Wikipedia moderation wall

Wikipedia presents an even steeper challenge. A study from Princeton University recently analyzed AI-generated content on Wikipedia and found that human moderators are incredibly efficient at spotting and removing promotional content. When marketers attempt to use generative tools to create self-promotional pages or “nudge” existing articles, the quality typically falls below Wikipedia’s strict standards.

The Princeton researchers found that these “hacked” articles often lacked proper footnotes and internal links. Human editors quickly identified this as “unambiguous advertising,” leading not only to the deletion of the content but to the active banning of the accounts involved. For a brand, being blacklisted by Wikipedia editors is a permanent stain that can influence how AI models—which ingest Wikipedia’s entire edit history—view your brand’s credibility.

Paraphrasing and the loss of narrative control

Even if you successfully plant a mention on Reddit or Wikipedia, you lose control over your product’s positioning. As Benji Hyam has noted, Reddit mentions are often too brief and lack the technical depth necessary for an LLM to associate a product with a specific complex problem. Furthermore, AI tools do not quote these sources word-for-word.

Data shows that AI responses have a semantic similarity score of only 0.53 when compared to their Reddit sources. This means the AI is blending, mashing, and paraphrasing your carefully crafted “organic” mention with other random, anonymous comments. Your value proposition gets diluted into a dry, neutral, or potentially confusing summary. At that point, the “citation” provides very little actual marketing value.

The hidden risks of astroturfing and data feeds

Beyond the lack of ROI, there is a significant reputational risk involved in trying to game these platforms. In the SEO world, this is known as “astroturfing”—the practice of creating a fake grassroots movement to promote a brand. In the age of AI, astroturfing is more dangerous than ever.

LLMs see what was deleted

One of the most critical misunderstandings in modern SEO is how AI models ingest data. Companies like Google and OpenAI do not just “scrape” the live web; they have direct data-sharing agreements with platforms like Reddit. Furthermore, Wikipedia’s entire edit history is open source and part of the training data for most LLMs.

This means that even if your agency’s fake comments or promotional edits are deleted by moderators within minutes, the AI still “sees” them in the data firehose. Because the AI can see the moderation pipeline, it can identify which links or mentions were flagged as spam. By attempting to game the system, you are essentially training the AI to associate your brand name with coordinated manipulation and low-trust signals. You aren’t building authority; you are building a “spam” label that could haunt your generative search rankings for years.

Community backlash

Reddit communities are notoriously protective of their space. Subreddits like r/hailcorporate are dedicated specifically to calling out and shaming brands that try to infiltrate discussions with “stealth” marketing. A failed Reddit campaign doesn’t just result in zero citations—it can result in a public relations nightmare that becomes a permanent part of the search results for your brand name.

Where AI actually looks for ground truth in B2B

If Reddit and Wikipedia are not the primary drivers for high-intent business recommendations, where are the LLMs getting their information? When you filter for specific industries and high-intent prompts, the “Reddit is everywhere” narrative falls apart.

Using AI visibility tools like Scrunch AI, researchers have tracked how LLMs respond to specific B2B software prompts. For one client, tracking over 300 custom prompts revealed that while Reddit was occasionally cited, the vast majority of recommendations were driven by just two specific, long-standing threads. The rest of the “Reddit dominance” simply wasn’t there for their niche.

The Wikipedia data was even more telling. For high-intent software queries, Wikipedia barely registered as a source. When it was cited, it was used almost exclusively for broad category definitions—explaining what “CRM” stands for, for example—rather than recommending which CRM a buyer should choose.

Niche authority and industry-specific domains

Data from Grow and Convert highlights that for specialized queries—such as “trucking software”—the AI consistently cites industry-specific domains like PCS Software or TruckingOffice. For project management queries, the AI leans on specialized review sites, niche blogs, and deep-dive comparison articles written by humans with actual experience in the field.

This proves that you don’t need to be visible “everywhere.” You only need to be visible in the “digital neighborhood” where your buyers live. AI engines are designed to find the most relevant, authoritative source for a specific question. For a B2B buyer, that source is rarely a random Reddit thread; it is an authoritative industry publication or a deeply technical product page.

How to actually earn AI recommendations: The path forward

Winning in the era of generative search requires a shift from “hacking” to “earning.” Instead of trying to trick the algorithm into citing you, you must provide the depth of information that makes you the most logical choice for the AI to recommend.

1. Prioritize deep, human-written owned content

Your own website is still your most valuable asset. To be recommended by an LLM, your content needs to be more than just a list of features. It needs to provide the granular, technical depth that an AI uses to understand context. Your key solution pages should explicitly cover:

Specific Use Cases: Don’t just say what your product does; explain exactly who it is for and what scenario they are in when they use it.
Problem-Solution Mapping: Clearly articulate the pain points your product solves. LLMs are excellent at “connecting the dots” between a user’s problem and a documented solution.
Comparative Value: Without being disparaging to competitors, explain your unique approach to solving industry problems. This helps the AI categorize you accurately.

2. Execute targeted citation outreach

Rather than casting a wide net on Reddit, use AI visibility tools to identify which niche domains the LLMs are already citing in your category. Once you identify these “influencer” domains—whether they are trade publications, specialized review sites, or niche blogs—focus your PR and guest posting efforts there. Earning a single mention on a highly authoritative industry site is worth more than a hundred manufactured Reddit comments.

3. Optimize for “Search Everywhere,” but don’t “Be Everywhere”

The mantra of “Search Everywhere” has been misinterpreted to mean that a brand needs a presence on every single platform. In reality, it means you should be present on the platforms that matter to your specific audience. If your buyers aren’t on Reddit to research enterprise-grade security software, you don’t need to be there either. Focus your resources on the three or four “neighborhoods” that drive the most high-intent traffic in your industry.

A sustainable approach to Reddit and Wikipedia

None of this is to say that Reddit and Wikipedia have no value. They are incredibly authoritative platforms. However, they should be treated as long-term brand-building assets, not short-term SEO tactics. If you want to leverage these platforms, you must respect their ecosystems.

Building a Reddit presence the right way

If you want to engage on Reddit, do so with transparency. Create an official branded subreddit where your product team can answer questions, host “Ask Me Anything” (AMA) sessions, and provide genuine support. Participate in existing communities by offering expert advice without dropping links. Over time, this builds “street cred” that the AI will eventually recognize as a genuine signal of authority.

The Wikipedia rule: Let others talk about you

The golden rule of Wikipedia is that you do not write about yourself. If your brand is truly significant in your industry, independent editors will eventually create a page for you based on secondary sources (like news articles and industry reports). Your job is to do things worth writing about in those secondary sources. The Wikipedia citation is a byproduct of being a leader in your field, not a task to be checked off by a marketing assistant.

Conclusion: Authority is earned, not manufactured

The charts showing Reddit and Wikipedia dominance are a reflection of the massive amount of general human knowledge stored on those sites. They are not a blueprint for B2B marketing success. Chasing these platforms with “growth hacks” and “astroturfing” is a high-risk, low-reward strategy that can actively damage your brand’s standing with the very AI models you are trying to influence.

The path to winning AI recommendations is simpler, though it requires more effort: build a brand that is genuinely recommendable. Focus on deep, authoritative content on your own domain, earn mentions in niche-specific publications, and engage authentically with your community. AI engines are mirrors—they reflect the authority and trust that already exists in the digital world. If you want the algorithm to recommend your brand, you have to do the work to be the best answer in the room.