Reddit is the single most-cited domain in AI search, but LLMs pull old, low-vote threads, not viral posts. How they really use Reddit for product queries.
Reddit is the most-cited domain in AI search, and the instinct that follows is almost always wrong. Operators read "Reddit gets cited more than any news outlet" and go chase a viral thread with 10,000 upvotes, or worse, pay for one. The data says LLMs are doing something close to the opposite: the average cited Reddit post is roughly 900 days old, carries a median of 5 to 8 upvotes, and runs about 80 words. AI engines are not surfacing Reddit's front page. They are mining its long tail for a specific person answering a specific product question in plain language, and that is a very different thing to optimize for.
This is the operator read on how ChatGPT, Perplexity, and Google actually pull Reddit for product and category queries, which engines lean on it hardest, and why the thread that gets you cited looks nothing like the thread that goes viral. Signals runs an aged Reddit account marketplace plus an editorial network for AI brand mentions across Reddit, Quora, Product Hunt, and Threads, so we watch both sides of this: which Reddit content the engines retrieve, and what it takes to be the account credibly present in it. If you only take one idea from this piece, make it that Reddit visibility in AI is an evergreen-seeding game, not a velocity game.
Through two mechanisms operators keep blurring: training and live retrieval. Reddit is baked into the models through licensing deals, and it is pulled fresh at answer time through search. For a product query, the retrieval half usually does the visible work, and it reaches for Reddit because that is where real buyers argue about real tools in words the model can extract cleanly.
The training layer is contractual, not accidental. Google signed a $60M-per-year licensing deal with Reddit in February 2024, and OpenAI signed its own partnership to bring Reddit content into ChatGPT. So Reddit sits in the weights and gets retrieved live. When someone asks an LLM "best CRM for a small agency," the engine is not inventing an opinion. It is compressing a decade of r/msp and r/sales threads into a recommendation, then citing the thread that most cleanly matches the question.
Reddit's dominance is real but wildly uneven by engine, and treating "Reddit gets cited" as one number will misdirect your effort. On aggregate it is the single most-cited domain in AI, at roughly 40.1% citation frequency across engines per Semrush. But the per-engine split is where the strategy lives: Perplexity is Reddit-first, SearchGPT and Google AI Mode are Reddit-heavy, and Gemini is essentially Reddit-blind.
The table below draws on Semrush's 248,000-URL study (refreshed October 2025) and Tinuiti's Q1 2026 AI Citations Trends report across seven engines and nine commercial categories.
| Engine | Reddit's role | Reddit citation weight |
|---|---|---|
| Perplexity | Reddit-first, highest prominence | ~24% of all citations (Tinuiti, Jan 2026) |
| SearchGPT / ChatGPT | Reddit-heavy, volatile | ~13% of responses (Semrush); >5% (Tinuiti) |
| Google AI Mode | Reddit-heavy retrieval | ~9% of responses (Semrush) |
| Google AI Overviews | Social-citation anchor | 44% of AIO social citations (Tinuiti) |
| Gemini | Nearly Reddit-blind | ~0.1% of citations (Tinuiti) |
The operator read: if your category lives on Perplexity and SearchGPT, Reddit is your highest-leverage surface. If your buyers use Gemini, Reddit barely moves the needle and you invest elsewhere.
Because LLMs are building a durable knowledge base, not chasing the front page, and age is a feature, not a bug. Semrush's analysis of 248K cited Reddit URLs found the average cited post is around 900 days old, roughly 2.5 years, with 4% of cited posts dating to 2019 or earlier. A thread that has sat stably answering the same question for two years reads to a model as settled consensus.
This inverts the organic-Reddit playbook. Upvote velocity, the thing that wins the hot algorithm and the thing operators pay for, is close to irrelevant here. Semrush found cited posts had a median of just 5 to 8 upvotes, and 80% had fewer than 20. What predicts citation is topical fit: cited posts scored 0.53 to 0.54 semantic similarity to the AI answer, versus 0.04 to 0.05 between the user's prompt and the post. The model is not asking "is this popular." It is asking "does this passage answer the question," then reaching for the clearest match regardless of score.
No, and this is the single most expensive misconception in Reddit GEO. Upvotes are a ranking signal inside Reddit and a weak-to-nonexistent citation signal for LLMs. RockSalt's factor testing and Semrush's study agree: thread age, votes, and comment counts were weak predictors of whether an engine cites a post. A low-vote thread that nails intent and is cleanly indexable can beat a heavily upvoted one.
What upvotes still do is upstream. Velocity gets a post to survive its first hour, hold a spot in the subreddit, and accumulate the replies that make a thread substantive enough to be worth retrieving two years later. So upvotes matter for durability, not for citation. The mistake is treating them as the finish line. The finish line is a well-worded, on-topic answer sitting in an indexable thread that the model can extract months from now. That is why we frame Reddit AI visibility as seeding, not a velocity buy.
Q&A and comparison threads, overwhelmingly. In the Semrush study, question-and-answer threads made up more than 50% of all cited Reddit content, with comparison posts second and discussion threads third, together nearly three-quarters of citations. That is not a coincidence. A product or category query is itself a question, and the retrieval system reaches for the passage shaped most like the answer.
The cited unit is also small. Median cited post length was about 80 words, which means the model is extracting a tight, specific reply, not a 2,000-word effortpost. Discovered Labs reached the same conclusion on content type: threads that name specific tools, state a clear preference, and give a concrete reason outperform vague discussion. The practical target is a comment that reads like a good answer on its own, in a thread titled like the query you want to win. "Best [category] for [use case]" as a thread title, with named, reasoned replies underneath, is the shape LLMs reward.
:::
Play the evergreen game: get a specific, well-reasoned, on-topic answer into an indexable thread and let it age. The winning asset is a comment that names your product with a concrete reason, sitting under a question-shaped title, in a subreddit an engine trusts. Because citation follows topical fit rather than votes, one genuinely useful reply outperforms a bought upvote spike every time.
The sequencing that works, drawn from what we see get retrieved:
Target the query, not the karma. Find or start threads titled like the product and category questions you want to win ("best X for Y," "X vs Z for [use case]"). These are >50% of citations.
Answer in ~80 to 150 words with named specifics. Say the product, the use case, and the reason. The extractable passage is what gets cited.
Use a credible account. Low-trust and shadowbanned accounts get filtered before their comment ever becomes indexable. This is where aged accounts and a warmed comment layer matter.
Give it time. The median cited thread is 2.5 years old. Seed now for answers that surface across a retraining and reindexing cycle, not next week.
Reddit is one node in a larger citation graph, and over-indexing on it is its own trap. It barely registers on Gemini, and on Perplexity it competes with primary sources. Map it against the other surfaces the way our 50-domains analysis does, treat editorial brand mentions as the backbone, and use Reddit for the buyer-in-the-wild layer it uniquely owns. For the Perplexity-specific version of this, the Reddit-first Perplexity playbook goes deeper.
Yes, heavily. Reddit is the single most-cited domain across AI engines at roughly 40% aggregate citation frequency, and ChatGPT/SearchGPT cite it in around 13% of responses per Semrush, or over 5% of all citations per Tinuiti's January 2026 data. For product and category queries specifically, Reddit is where the model finds real buyers comparing tools, which is why Q&A and comparison threads make up more than half of cited Reddit content. The catch is that the cited thread is usually old and low-vote, not a viral post.
Barely. Semrush found cited posts had a median of just 5 to 8 upvotes and 80% had fewer than 20, and vote counts were weak predictors of citation. What predicts citation is topical alignment: how closely the passage answers the question. Upvotes still matter upstream, because velocity helps a post survive and accumulate substance, but buying upvotes to force an AI citation misreads the mechanism. A clear, on-topic answer in an indexable thread beats a high-vote thread that does not match the query.
Perplexity. Tinuiti's Q1 2026 report put Reddit at around 24% of all Perplexity citations in January 2026, the highest prominence of any engine. SearchGPT (~13% of responses) and Google AI Mode (~9%) are also Reddit-heavy. At the other extreme, Gemini cited Reddit in only about 0.1% of responses, so if your buyers use Gemini, Reddit is nearly irrelevant and you should invest in Google-ecosystem entity signals instead.
Because LLMs treat Reddit as a durable knowledge base, not a news feed. The average cited Reddit post is around 900 days old, and 4% predate 2019. A thread that has stably answered the same question for a couple of years reads as settled consensus, which is exactly what a model wants when recommending a product. This is why Reddit AI visibility is a seeding-and-aging game: you plant a well-worded answer now and it surfaces across future reindexing cycles.
No. LLMs do not filter for sentiment. Positive and negative brand mentions on Reddit get cited at nearly identical rates, with a slight lean toward negative experience reports. So an unaddressed complaint thread can become the passage a model extracts about you. Getting cited and being described well are two different outcomes, which is why the seeding layer and an active, credible presence in the threads that rank for your category both matter.
:::
Reddit is the single most-cited domain in AI search, but LLMs pull old, low-vote threads, not viral posts. How they really use Reddit for product queries.
Reddit is the most-cited domain in AI search, and the instinct that follows is almost always wrong. Operators read "Reddit gets cited more than any news outlet" and go chase a viral thread with 10,000 upvotes, or worse, pay for one. The data says LLMs are doing something close to the opposite: the average cited Reddit post is roughly 900 days old, carries a median of 5 to 8 upvotes, and runs about 80 words. AI engines are not surfacing Reddit's front page. They are mining its long tail for a specific person answering a specific product question in plain language, and that is a very different thing to optimize for.
This is the operator read on how ChatGPT, Perplexity, and Google actually pull Reddit for product and category queries, which engines lean on it hardest, and why the thread that gets you cited looks nothing like the thread that goes viral. Signals runs an aged Reddit account marketplace plus an editorial network for AI brand mentions across Reddit, Quora, Product Hunt, and Threads, so we watch both sides of this: which Reddit content the engines retrieve, and what it takes to be the account credibly present in it. If you only take one idea from this piece, make it that Reddit visibility in AI is an evergreen-seeding game, not a velocity game.
Key takeaways
Reddit is the #1 most-cited domain across AI engines at roughly 40% aggregate citation frequency, ahead of YouTube and every major news outlet combined, per Semrush.
LLMs favor old, low-engagement threads: the average cited Reddit post is ~900 days old with a median of 5 to 8 upvotes, and 80% of cited posts had fewer than 20 upvotes, per Semrush's 248K-post study.
Topical alignment beats engagement. Cited posts showed 0.53 to 0.54 semantic similarity to the AI answer; upvotes and comment counts were weak predictors of citation.
Reddit's weight is engine-specific: it appears in ~13% of SearchGPT responses and ~9% of Google AI Mode, but is most prominent on Perplexity, where Tinuiti put Reddit at ~24% of all January 2026 citations. Gemini cited Reddit in just 0.1%.
Q&A and comparison threads make up nearly three-quarters of cited Reddit content, which is exactly the format a product or category query maps to.
Through two mechanisms operators keep blurring: training and live retrieval. Reddit is baked into the models through licensing deals, and it is pulled fresh at answer time through search. For a product query, the retrieval half usually does the visible work, and it reaches for Reddit because that is where real buyers argue about real tools in words the model can extract cleanly.
The training layer is contractual, not accidental. Google signed a $60M-per-year licensing deal with Reddit in February 2024, and OpenAI signed its own partnership to bring Reddit content into ChatGPT. So Reddit sits in the weights and gets retrieved live. When someone asks an LLM "best CRM for a small agency," the engine is not inventing an opinion. It is compressing a decade of r/msp and r/sales threads into a recommendation, then citing the thread that most cleanly matches the question.
Reddit's dominance is real but wildly uneven by engine, and treating "Reddit gets cited" as one number will misdirect your effort. On aggregate it is the single most-cited domain in AI, at roughly 40.1% citation frequency across engines per Semrush. But the per-engine split is where the strategy lives: Perplexity is Reddit-first, SearchGPT and Google AI Mode are Reddit-heavy, and Gemini is essentially Reddit-blind.
The table below draws on Semrush's 248,000-URL study (refreshed October 2025) and Tinuiti's Q1 2026 AI Citations Trends report across seven engines and nine commercial categories.
| Engine | Reddit's role | Reddit citation weight |
|---|---|---|
| Perplexity | Reddit-first, highest prominence | ~24% of all citations (Tinuiti, Jan 2026) |
| SearchGPT / ChatGPT | Reddit-heavy, volatile | ~13% of responses (Semrush); >5% (Tinuiti) |
| Google AI Mode | Reddit-heavy retrieval | ~9% of responses (Semrush) |
| Google AI Overviews | Social-citation anchor | 44% of AIO social citations (Tinuiti) |
| Gemini | Nearly Reddit-blind | ~0.1% of citations (Tinuiti) |
The operator read: if your category lives on Perplexity and SearchGPT, Reddit is your highest-leverage surface. If your buyers use Gemini, Reddit barely moves the needle and you invest elsewhere.
Because LLMs are building a durable knowledge base, not chasing the front page, and age is a feature, not a bug. Semrush's analysis of 248K cited Reddit URLs found the average cited post is around 900 days old, roughly 2.5 years, with 4% of cited posts dating to 2019 or earlier. A thread that has sat stably answering the same question for two years reads to a model as settled consensus.
This inverts the organic-Reddit playbook. Upvote velocity, the thing that wins the hot algorithm and the thing operators pay for, is close to irrelevant here. Semrush found cited posts had a median of just 5 to 8 upvotes, and 80% had fewer than 20. What predicts citation is topical fit: cited posts scored 0.53 to 0.54 semantic similarity to the AI answer, versus 0.04 to 0.05 between the user's prompt and the post. The model is not asking "is this popular." It is asking "does this passage answer the question," then reaching for the clearest match regardless of score.
No, and this is the single most expensive misconception in Reddit GEO. Upvotes are a ranking signal inside Reddit and a weak-to-nonexistent citation signal for LLMs. RockSalt's factor testing and Semrush's study agree: thread age, votes, and comment counts were weak predictors of whether an engine cites a post. A low-vote thread that nails intent and is cleanly indexable can beat a heavily upvoted one.
What upvotes still do is upstream. Velocity gets a post to survive its first hour, hold a spot in the subreddit, and accumulate the replies that make a thread substantive enough to be worth retrieving two years later. So upvotes matter for durability, not for citation. The mistake is treating them as the finish line. The finish line is a well-worded, on-topic answer sitting in an indexable thread that the model can extract months from now. That is why we frame Reddit AI visibility as seeding, not a velocity buy.
A word of honesty about sentiment: LLMs do not filter Reddit for constructive feedback. Semrush-adjacent analysis found positive (5%) and negative (6.1%) brand mentions get cited at nearly the same rate, with a slight tilt toward negative experience reports. Getting cited is not the same as being described well. If the loudest indexable thread about your product is a complaint, that is the passage the model extracts.
Q&A and comparison threads, overwhelmingly. In the Semrush study, question-and-answer threads made up more than 50% of all cited Reddit content, with comparison posts second and discussion threads third, together nearly three-quarters of citations. That is not a coincidence. A product or category query is itself a question, and the retrieval system reaches for the passage shaped most like the answer.
The cited unit is also small. Median cited post length was about 80 words, which means the model is extracting a tight, specific reply, not a 2,000-word effortpost. Discovered Labs reached the same conclusion on content type: threads that name specific tools, state a clear preference, and give a concrete reason outperform vague discussion. The practical target is a comment that reads like a good answer on its own, in a thread titled like the query you want to win. "Best [category] for [use case]" as a thread title, with named, reasoned replies underneath, is the shape LLMs reward.
:::
Play the evergreen game: get a specific, well-reasoned, on-topic answer into an indexable thread and let it age. The winning asset is a comment that names your product with a concrete reason, sitting under a question-shaped title, in a subreddit an engine trusts. Because citation follows topical fit rather than votes, one genuinely useful reply outperforms a bought upvote spike every time.
The sequencing that works, drawn from what we see get retrieved:
Target the query, not the karma. Find or start threads titled like the product and category questions you want to win ("best X for Y," "X vs Z for [use case]"). These are >50% of citations.
Answer in ~80 to 150 words with named specifics. Say the product, the use case, and the reason. The extractable passage is what gets cited.
Use a credible account. Low-trust and shadowbanned accounts get filtered before their comment ever becomes indexable. This is where aged accounts and a warmed comment layer matter.
Give it time. The median cited thread is 2.5 years old. Seed now for answers that surface across a retraining and reindexing cycle, not next week.
Reddit is one node in a larger citation graph, and over-indexing on it is its own trap. It barely registers on Gemini, and on Perplexity it competes with primary sources. Map it against the other surfaces the way our 50-domains analysis does, treat editorial brand mentions as the backbone, and use Reddit for the buyer-in-the-wild layer it uniquely owns. For the Perplexity-specific version of this, the Reddit-first Perplexity playbook goes deeper.
Yes, heavily. Reddit is the single most-cited domain across AI engines at roughly 40% aggregate citation frequency, and ChatGPT/SearchGPT cite it in around 13% of responses per Semrush, or over 5% of all citations per Tinuiti's January 2026 data. For product and category queries specifically, Reddit is where the model finds real buyers comparing tools, which is why Q&A and comparison threads make up more than half of cited Reddit content. The catch is that the cited thread is usually old and low-vote, not a viral post.
Barely. Semrush found cited posts had a median of just 5 to 8 upvotes and 80% had fewer than 20, and vote counts were weak predictors of citation. What predicts citation is topical alignment: how closely the passage answers the question. Upvotes still matter upstream, because velocity helps a post survive and accumulate substance, but buying upvotes to force an AI citation misreads the mechanism. A clear, on-topic answer in an indexable thread beats a high-vote thread that does not match the query.
Perplexity. Tinuiti's Q1 2026 report put Reddit at around 24% of all Perplexity citations in January 2026, the highest prominence of any engine. SearchGPT (~13% of responses) and Google AI Mode (~9%) are also Reddit-heavy. At the other extreme, Gemini cited Reddit in only about 0.1% of responses, so if your buyers use Gemini, Reddit is nearly irrelevant and you should invest in Google-ecosystem entity signals instead.
Because LLMs treat Reddit as a durable knowledge base, not a news feed. The average cited Reddit post is around 900 days old, and 4% predate 2019. A thread that has stably answered the same question for a couple of years reads as settled consensus, which is exactly what a model wants when recommending a product. This is why Reddit AI visibility is a seeding-and-aging game: you plant a well-worded answer now and it surfaces across future reindexing cycles.
No. LLMs do not filter for sentiment. Positive and negative brand mentions on Reddit get cited at nearly identical rates, with a slight lean toward negative experience reports. So an unaddressed complaint thread can become the passage a model extracts about you. Getting cited and being described well are two different outcomes, which is why the seeding layer and an active, credible presence in the threads that rank for your category both matter.
:::
LLMs cite Reddit because it reads as independent, real-buyer opinion, and the same logic drives every other surface they trust. Signals' editorial network earns named-author brand mentions across a 20,000-plus site footprint, seeding the durable, on-topic coverage that gets your brand into AI answers instead of leaving it to whatever thread happens to rank.
Sources