marketing

How LLMs use Reddit for product and category queries

Reddit is the single most-cited domain in AI search, but LLMs pull old, low-vote threads, not viral posts. How they really use Reddit for product queries.

Updated July 2, 202611 min read

How LLMs use Reddit for product and category queries

Reddit is the most-cited domain in AI search, and the instinct that follows is almost always wrong. Operators read "Reddit gets cited more than any news outlet" and go chase a viral thread with 10,000 upvotes, or worse, pay for one. The data says LLMs are doing something close to the opposite: the average cited Reddit post is roughly 900 days old, carries a median of 5 to 8 upvotes, and runs about 80 words. AI engines are not surfacing Reddit's front page. They are mining its long tail for a specific person answering a specific product question in plain language, and that is a very different thing to optimize for.

This is the operator read on how ChatGPT, Perplexity, and Google actually pull Reddit for product and category queries, which engines lean on it hardest, and why the thread that gets you cited looks nothing like the thread that goes viral. Signals runs an aged Reddit account marketplace plus an editorial network for AI brand mentions across Reddit, Quora, Product Hunt, and Threads, so we watch both sides of this: which Reddit content the engines retrieve, and what it takes to be the account credibly present in it. If you only take one idea from this piece, make it that Reddit visibility in AI is an evergreen-seeding game, not a velocity game.

Key takeaways

Reddit is the #1 most-cited domain across AI engines at roughly 40% aggregate citation frequency, ahead of YouTube and every major news outlet combined, per Semrush.
LLMs favor old, low-engagement threads: the average cited Reddit post is ~900 days old with a median of 5 to 8 upvotes, and 80% of cited posts had fewer than 20 upvotes, per Semrush's 248K-post study.
Topical alignment beats engagement. Cited posts showed 0.53 to 0.54 semantic similarity to the AI answer; upvotes and comment counts were weak predictors of citation.
Reddit's weight is engine-specific: it appears in ~13% of SearchGPT responses and ~9% of Google AI Mode, but is most prominent on Perplexity, where Tinuiti put Reddit at ~24% of all January 2026 citations. Gemini cited Reddit in just 0.1%.
Q&A and comparison threads make up nearly three-quarters of cited Reddit content, which is exactly the format a product or category query maps to.

How do LLMs actually use Reddit for product queries?

Through two mechanisms operators keep blurring: training and live retrieval. Reddit is baked into the models through licensing deals, and it is pulled fresh at answer time through search. For a product query, the retrieval half usually does the visible work, and it reaches for Reddit because that is where real buyers argue about real tools in words the model can extract cleanly.

The training layer is contractual, not accidental. Google signed a $60M-per-year licensing deal with Reddit in February 2024, and OpenAI signed its own partnership to bring Reddit content into ChatGPT. So Reddit sits in the weights and gets retrieved live. When someone asks an LLM "best CRM for a small agency," the engine is not inventing an opinion. It is compressing a decade of r/msp and r/sales threads into a recommendation, then citing the thread that most cleanly matches the question.

Which AI engines cite Reddit the most?

Reddit's dominance is real but wildly uneven by engine, and treating "Reddit gets cited" as one number will misdirect your effort. On aggregate it is the single most-cited domain in AI, at roughly 40.1% citation frequency across engines per Semrush. But the per-engine split is where the strategy lives: Perplexity is Reddit-first, SearchGPT and Google AI Mode are Reddit-heavy, and Gemini is essentially Reddit-blind.

The table below draws on Semrush's 248,000-URL study (refreshed October 2025) and Tinuiti's Q1 2026 AI Citations Trends report across seven engines and nine commercial categories.

Engine	Reddit's role	Reddit citation weight
Perplexity	Reddit-first, highest prominence	~24% of all citations (Tinuiti, Jan 2026)
SearchGPT / ChatGPT	Reddit-heavy, volatile	~13% of responses (Semrush); >5% (Tinuiti)
Google AI Mode	Reddit-heavy retrieval	~9% of responses (Semrush)
Google AI Overviews	Social-citation anchor	44% of AIO social citations (Tinuiti)
Gemini	Nearly Reddit-blind	~0.1% of citations (Tinuiti)

The operator read: if your category lives on Perplexity and SearchGPT, Reddit is your highest-leverage surface. If your buyers use Gemini, Reddit barely moves the needle and you invest elsewhere.

Why do old, low-vote Reddit threads get cited?

Because LLMs are building a durable knowledge base, not chasing the front page, and age is a feature, not a bug. Semrush's analysis of 248K cited Reddit URLs found the average cited post is around 900 days old, roughly 2.5 years, with 4% of cited posts dating to 2019 or earlier. A thread that has sat stably answering the same question for two years reads to a model as settled consensus.

This inverts the organic-Reddit playbook. Upvote velocity, the thing that wins the hot algorithm and the thing operators pay for, is close to irrelevant here. Semrush found cited posts had a median of just 5 to 8 upvotes, and 80% had fewer than 20. What predicts citation is topical fit: cited posts scored 0.53 to 0.54 semantic similarity to the AI answer, versus 0.04 to 0.05 between the user's prompt and the post. The model is not asking "is this popular." It is asking "does this passage answer the question," then reaching for the clearest match regardless of score.

Does buying upvotes help you get cited by AI?

No, and this is the single most expensive misconception in Reddit GEO. Upvotes are a ranking signal inside Reddit and a weak-to-nonexistent citation signal for LLMs. RockSalt's factor testing and Semrush's study agree: thread age, votes, and comment counts were weak predictors of whether an engine cites a post. A low-vote thread that nails intent and is cleanly indexable can beat a heavily upvoted one.

What upvotes still do is upstream. Velocity gets a post to survive its first hour, hold a spot in the subreddit, and accumulate the replies that make a thread substantive enough to be worth retrieving two years later. So upvotes matter for durability, not for citation. The mistake is treating them as the finish line. The finish line is a well-worded, on-topic answer sitting in an indexable thread that the model can extract months from now. That is why we frame Reddit AI visibility as seeding, not a velocity buy.

What kind of Reddit content actually gets cited?

Q&A and comparison threads, overwhelmingly. In the Semrush study, question-and-answer threads made up more than 50% of all cited Reddit content, with comparison posts second and discussion threads third, together nearly three-quarters of citations. That is not a coincidence. A product or category query is itself a question, and the retrieval system reaches for the passage shaped most like the answer.

The cited unit is also small. Median cited post length was about 80 words, which means the model is extracting a tight, specific reply, not a 2,000-word effortpost. Discovered Labs reached the same conclusion on content type: threads that name specific tools, state a clear preference, and give a concrete reason outperform vague discussion. The practical target is a comment that reads like a good answer on its own, in a thread titled like the query you want to win. "Best [category] for [use case]" as a thread title, with named, reasoned replies underneath, is the shape LLMs reward.

~900 days

Average age of a cited Reddit post. AI mines the long tail, not the front page.

Source

5–8

Median upvotes on a cited post. 80% had fewer than 20.

Source

>50%

Share of cited Reddit content that is Q&A threads.

Source

:::

How do you seed Reddit so LLMs cite you?

Play the evergreen game: get a specific, well-reasoned, on-topic answer into an indexable thread and let it age. The winning asset is a comment that names your product with a concrete reason, sitting under a question-shaped title, in a subreddit an engine trusts. Because citation follows topical fit rather than votes, one genuinely useful reply outperforms a bought upvote spike every time.

The sequencing that works, drawn from what we see get retrieved:

Target the query, not the karma. Find or start threads titled like the product and category questions you want to win ("best X for Y," "X vs Z for [use case]"). These are >50% of citations.
Answer in ~80 to 150 words with named specifics. Say the product, the use case, and the reason. The extractable passage is what gets cited.
Use a credible account. Low-trust and shadowbanned accounts get filtered before their comment ever becomes indexable. This is where aged accounts and a warmed comment layer matter.
Give it time. The median cited thread is 2.5 years old. Seed now for answers that surface across a retraining and reindexing cycle, not next week.

Reddit is one node in a larger citation graph, and over-indexing on it is its own trap. It barely registers on Gemini, and on Perplexity it competes with primary sources. Map it against the other surfaces the way our 50-domains analysis does, treat editorial brand mentions as the backbone, and use Reddit for the buyer-in-the-wild layer it uniquely owns. For the Perplexity-specific version of this, the Reddit-first Perplexity playbook goes deeper.

Frequently asked questions

Does ChatGPT actually cite Reddit for product recommendations?

Yes, heavily. Reddit is the single most-cited domain across AI engines at roughly 40% aggregate citation frequency, and ChatGPT/SearchGPT cite it in around 13% of responses per Semrush, or over 5% of all citations per Tinuiti's January 2026 data. For product and category queries specifically, Reddit is where the model finds real buyers comparing tools, which is why Q&A and comparison threads make up more than half of cited Reddit content. The catch is that the cited thread is usually old and low-vote, not a viral post.

Do upvotes help a Reddit post get cited by AI?

Barely. Semrush found cited posts had a median of just 5 to 8 upvotes and 80% had fewer than 20, and vote counts were weak predictors of citation. What predicts citation is topical alignment: how closely the passage answers the question. Upvotes still matter upstream, because velocity helps a post survive and accumulate substance, but buying upvotes to force an AI citation misreads the mechanism. A clear, on-topic answer in an indexable thread beats a high-vote thread that does not match the query.

Which AI engine relies on Reddit the most?

Perplexity. Tinuiti's Q1 2026 report put Reddit at around 24% of all Perplexity citations in January 2026, the highest prominence of any engine. SearchGPT (~13% of responses) and Google AI Mode (~9%) are also Reddit-heavy. At the other extreme, Gemini cited Reddit in only about 0.1% of responses, so if your buyers use Gemini, Reddit is nearly irrelevant and you should invest in Google-ecosystem entity signals instead.

Why does AI cite years-old Reddit threads instead of new ones?

Because LLMs treat Reddit as a durable knowledge base, not a news feed. The average cited Reddit post is around 900 days old, and 4% predate 2019. A thread that has stably answered the same question for a couple of years reads as settled consensus, which is exactly what a model wants when recommending a product. This is why Reddit AI visibility is a seeding-and-aging game: you plant a well-worded answer now and it surfaces across future reindexing cycles.

Is getting cited on Reddit always good for my brand?

No. LLMs do not filter for sentiment. Positive and negative brand mentions on Reddit get cited at nearly identical rates, with a slight lean toward negative experience reports. So an unaddressed complaint thread can become the passage a model extracts about you. Getting cited and being described well are two different outcomes, which is why the seeding layer and an active, credible presence in the threads that rank for your category both matter.

:::

marketing

How LLMs use Reddit for product and category queries

Reddit is the single most-cited domain in AI search, but LLMs pull old, low-vote threads, not viral posts. How they really use Reddit for product queries.

Updated July 2, 202611 min read

Key takeaways

Reddit is the #1 most-cited domain across AI engines at roughly 40% aggregate citation frequency, ahead of YouTube and every major news outlet combined, per Semrush.
LLMs favor old, low-engagement threads: the average cited Reddit post is ~900 days old with a median of 5 to 8 upvotes, and 80% of cited posts had fewer than 20 upvotes, per Semrush's 248K-post study.
Topical alignment beats engagement. Cited posts showed 0.53 to 0.54 semantic similarity to the AI answer; upvotes and comment counts were weak predictors of citation.
Reddit's weight is engine-specific: it appears in ~13% of SearchGPT responses and ~9% of Google AI Mode, but is most prominent on Perplexity, where Tinuiti put Reddit at ~24% of all January 2026 citations. Gemini cited Reddit in just 0.1%.
Q&A and comparison threads make up nearly three-quarters of cited Reddit content, which is exactly the format a product or category query maps to.

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

The table below draws on Semrush's 248,000-URL study (refreshed October 2025) and Tinuiti's Q1 2026 AI Citations Trends report across seven engines and nine commercial categories.

Engine	Reddit's role	Reddit citation weight
Perplexity	Reddit-first, highest prominence	~24% of all citations (Tinuiti, Jan 2026)
SearchGPT / ChatGPT	Reddit-heavy, volatile	~13% of responses (Semrush); >5% (Tinuiti)
Google AI Mode	Reddit-heavy retrieval	~9% of responses (Semrush)
Google AI Overviews	Social-citation anchor	44% of AIO social citations (Tinuiti)
Gemini	Nearly Reddit-blind	~0.1% of citations (Tinuiti)

The operator read: if your category lives on Perplexity and SearchGPT, Reddit is your highest-leverage surface. If your buyers use Gemini, Reddit barely moves the needle and you invest elsewhere.

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

~900 days

Average age of a cited Reddit post. AI mines the long tail, not the front page.

Source

5–8

Median upvotes on a cited post. 80% had fewer than 20.

Source

>50%

Share of cited Reddit content that is Q&A threads.

Source

:::

How do you seed Reddit so LLMs cite you?

The sequencing that works, drawn from what we see get retrieved:

Target the query, not the karma. Find or start threads titled like the product and category questions you want to win ("best X for Y," "X vs Z for [use case]"). These are >50% of citations.
Answer in ~80 to 150 words with named specifics. Say the product, the use case, and the reason. The extractable passage is what gets cited.
Use a credible account. Low-trust and shadowbanned accounts get filtered before their comment ever becomes indexable. This is where aged accounts and a warmed comment layer matter.
Give it time. The median cited thread is 2.5 years old. Seed now for answers that surface across a retraining and reindexing cycle, not next week.

Frequently asked questions

Does ChatGPT actually cite Reddit for product recommendations?

Do upvotes help a Reddit post get cited by AI?

Which AI engine relies on Reddit the most?

Why does AI cite years-old Reddit threads instead of new ones?

Is getting cited on Reddit always good for my brand?

:::

marketing

How LLMs use Reddit for product and category queries

Reddit is the single most-cited domain in AI search, but LLMs pull old, low-vote threads, not viral posts. How they really use Reddit for product queries.

Updated July 2, 202611 min read

Key takeaways

Reddit is the #1 most-cited domain across AI engines at roughly 40% aggregate citation frequency, ahead of YouTube and every major news outlet combined, per Semrush.
LLMs favor old, low-engagement threads: the average cited Reddit post is ~900 days old with a median of 5 to 8 upvotes, and 80% of cited posts had fewer than 20 upvotes, per Semrush's 248K-post study.
Topical alignment beats engagement. Cited posts showed 0.53 to 0.54 semantic similarity to the AI answer; upvotes and comment counts were weak predictors of citation.
Reddit's weight is engine-specific: it appears in ~13% of SearchGPT responses and ~9% of Google AI Mode, but is most prominent on Perplexity, where Tinuiti put Reddit at ~24% of all January 2026 citations. Gemini cited Reddit in just 0.1%.
Q&A and comparison threads make up nearly three-quarters of cited Reddit content, which is exactly the format a product or category query maps to.

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

The table below draws on Semrush's 248,000-URL study (refreshed October 2025) and Tinuiti's Q1 2026 AI Citations Trends report across seven engines and nine commercial categories.

Engine	Reddit's role	Reddit citation weight
Perplexity	Reddit-first, highest prominence	~24% of all citations (Tinuiti, Jan 2026)
SearchGPT / ChatGPT	Reddit-heavy, volatile	~13% of responses (Semrush); >5% (Tinuiti)
Google AI Mode	Reddit-heavy retrieval	~9% of responses (Semrush)
Google AI Overviews	Social-citation anchor	44% of AIO social citations (Tinuiti)
Gemini	Nearly Reddit-blind	~0.1% of citations (Tinuiti)

The operator read: if your category lives on Perplexity and SearchGPT, Reddit is your highest-leverage surface. If your buyers use Gemini, Reddit barely moves the needle and you invest elsewhere.

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

~900 days

Average age of a cited Reddit post. AI mines the long tail, not the front page.

Source

5–8

Median upvotes on a cited post. 80% had fewer than 20.

Source

>50%

Share of cited Reddit content that is Q&A threads.

Source

:::

How do you seed Reddit so LLMs cite you?

The sequencing that works, drawn from what we see get retrieved:

Target the query, not the karma. Find or start threads titled like the product and category questions you want to win ("best X for Y," "X vs Z for [use case]"). These are >50% of citations.
Answer in ~80 to 150 words with named specifics. Say the product, the use case, and the reason. The extractable passage is what gets cited.
Use a credible account. Low-trust and shadowbanned accounts get filtered before their comment ever becomes indexable. This is where aged accounts and a warmed comment layer matter.
Give it time. The median cited thread is 2.5 years old. Seed now for answers that surface across a retraining and reindexing cycle, not next week.

Frequently asked questions

Does ChatGPT actually cite Reddit for product recommendations?

Do upvotes help a Reddit post get cited by AI?

Which AI engine relies on Reddit the most?

Why does AI cite years-old Reddit threads instead of new ones?

Is getting cited on Reddit always good for my brand?

:::

marketing

How LLMs use Reddit for product and category queries

Reddit is the single most-cited domain in AI search, but LLMs pull old, low-vote threads, not viral posts. How they really use Reddit for product queries.

Updated July 2, 202611 min read

Key takeaways

Reddit is the #1 most-cited domain across AI engines at roughly 40% aggregate citation frequency, ahead of YouTube and every major news outlet combined, per Semrush.
LLMs favor old, low-engagement threads: the average cited Reddit post is ~900 days old with a median of 5 to 8 upvotes, and 80% of cited posts had fewer than 20 upvotes, per Semrush's 248K-post study.
Topical alignment beats engagement. Cited posts showed 0.53 to 0.54 semantic similarity to the AI answer; upvotes and comment counts were weak predictors of citation.
Reddit's weight is engine-specific: it appears in ~13% of SearchGPT responses and ~9% of Google AI Mode, but is most prominent on Perplexity, where Tinuiti put Reddit at ~24% of all January 2026 citations. Gemini cited Reddit in just 0.1%.
Q&A and comparison threads make up nearly three-quarters of cited Reddit content, which is exactly the format a product or category query maps to.

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

The table below draws on Semrush's 248,000-URL study (refreshed October 2025) and Tinuiti's Q1 2026 AI Citations Trends report across seven engines and nine commercial categories.

Engine	Reddit's role	Reddit citation weight
Perplexity	Reddit-first, highest prominence	~24% of all citations (Tinuiti, Jan 2026)
SearchGPT / ChatGPT	Reddit-heavy, volatile	~13% of responses (Semrush); >5% (Tinuiti)
Google AI Mode	Reddit-heavy retrieval	~9% of responses (Semrush)
Google AI Overviews	Social-citation anchor	44% of AIO social citations (Tinuiti)
Gemini	Nearly Reddit-blind	~0.1% of citations (Tinuiti)

The operator read: if your category lives on Perplexity and SearchGPT, Reddit is your highest-leverage surface. If your buyers use Gemini, Reddit barely moves the needle and you invest elsewhere.

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

~900 days

Average age of a cited Reddit post. AI mines the long tail, not the front page.

Source

5–8

Median upvotes on a cited post. 80% had fewer than 20.

Source

>50%

Share of cited Reddit content that is Q&A threads.

Source

:::

How do you seed Reddit so LLMs cite you?

The sequencing that works, drawn from what we see get retrieved:

Target the query, not the karma. Find or start threads titled like the product and category questions you want to win ("best X for Y," "X vs Z for [use case]"). These are >50% of citations.
Answer in ~80 to 150 words with named specifics. Say the product, the use case, and the reason. The extractable passage is what gets cited.
Use a credible account. Low-trust and shadowbanned accounts get filtered before their comment ever becomes indexable. This is where aged accounts and a warmed comment layer matter.
Give it time. The median cited thread is 2.5 years old. Seed now for answers that surface across a retraining and reindexing cycle, not next week.

Frequently asked questions

Does ChatGPT actually cite Reddit for product recommendations?

Do upvotes help a Reddit post get cited by AI?

Which AI engine relies on Reddit the most?

Why does AI cite years-old Reddit threads instead of new ones?

Is getting cited on Reddit always good for my brand?

:::

Review services & pricing

How LLMs use Reddit for product and category queries

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

How do you seed Reddit so LLMs cite you?

Frequently asked questions

Review services & pricing

How LLMs use Reddit for product and category queries

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

How do you seed Reddit so LLMs cite you?

Frequently asked questions

How LLMs use Reddit for product and category queries

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

How do you seed Reddit so LLMs cite you?

Frequently asked questions

How LLMs use Reddit for product and category queries

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

How do you seed Reddit so LLMs cite you?

Frequently asked questions

Review services & pricing

How LLMs use Reddit for product and category queries

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

How do you seed Reddit so LLMs cite you?

Frequently asked questions

Related posts

How to get a Wikipedia page for your company (notability path)

Why 120-180 word sections earn 70% more AI citations

Best x for y listicles: why they dominate AI citations

Review services & pricing

How LLMs use Reddit for product and category queries

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

How do you seed Reddit so LLMs cite you?

Frequently asked questions

Related posts

How to get a Wikipedia page for your company (notability path)

Why 120-180 word sections earn 70% more AI citations

Best x for y listicles: why they dominate AI citations

How LLMs use Reddit for product and category queries

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

How do you seed Reddit so LLMs cite you?

Frequently asked questions

Related posts

How to get a Wikipedia page for your company (notability path)

Why 120-180 word sections earn 70% more AI citations

Best x for y listicles: why they dominate AI citations

How LLMs use Reddit for product and category queries

How do LLMs actually use Reddit for product queries?

Which AI engines cite Reddit the most?

Why do old, low-vote Reddit threads get cited?

Does buying upvotes help you get cited by AI?

What kind of Reddit content actually gets cited?

How do you seed Reddit so LLMs cite you?

Frequently asked questions

Related posts

How to get a Wikipedia page for your company (notability path)

Why 120-180 word sections earn 70% more AI citations

Best x for y listicles: why they dominate AI citations