See How Visible Your Brand is in AI Search Get Free Report

OpenAI in Talks to Pay Reddit $70M for AI Training Data!

  • August 22, 2025
    Updated
openai-in-talks-to-pay-reddit-70m-for-ai-training-data

Key Takeaways

  • OpenAI is paying Reddit approximately $70 million per year for a content licensing agreement, making up a significant portion of Reddit’s AI-related revenue.
  • Reddit’s AI licensing deals account for 10% of its total revenue ($130 million), with Google confirmed to be paying $60 million and OpenAI contributing the remaining $70 million.
  • Reddit’s stock price dropped 15% in after-hours trading following its earnings report, attributed in part to slower-than-expected user growth.
  • Google’s search algorithm changes have affected Reddit’s traffic and visibility, potentially impacting its future growth.
  • SEO experts have raised concerns that Google is favoring Reddit in search results, sometimes ranking Reddit discussions higher than original sources.

According to Search Engine Land, OpenAI is paying Reddit around $70 million annually for access to its user-generated discussions.

This estimate comes from Reddit’s recent earnings report, which revealed that AI licensing agreements contribute 10% of the company’s total revenue—a figure amounting to $130 million per year.

Since Google has been confirmed to be paying $60 million for a similar licensing deal, OpenAI is believed to be covering the remaining $70 million.

Search industry expert Glenn Gabe provided further insight into the financial breakdown:

“Reddit told Adweek that its AI licensing deals make up about 10% of its revenue. Annual revenue for Reddit in 2024 was $1.3B, so 10% is $130M.

Google pays Reddit $60M so that means OpenAI is paying Reddit about $70M per year. That number has never been revealed. Again, interesting.”

Neither OpenAI nor Reddit has officially confirmed the exact amount, but these calculations align with Reddit’s reported revenue figures.

This licensing deal is significant because it represents a shift in how AI companies acquire training data.

Rather than relying on web scraping or publicly available content, OpenAI is entering formal agreements to legally source high-quality data for model training.

Reddit’s Value as an AI Training Dataset

Platforms like Reddit are valuable to AI companies because they contain real, organic conversations across a wide range of topics.

Unlike static web pages or structured datasets, Reddit discussions provide:

  • Unfiltered human interactions and opinions
  • Diverse sentence structures, slang, and colloquialisms
  • Discussions on niche topics that may not be available elsewhere

These attributes make Reddit’s data a goldmine for improving AI’s understanding of natural language and enhancing AI-generated responses in chatbots and search features.

From Scraping to Licensing: A Shift in AI Data Sourcing

Previously, AI companies like OpenAI and Google relied on web scraping to collect training data.

However, this practice has faced increasing scrutiny and legal challenges.

Major publishers, news organizations, and content creators have pushed back against AI firms using their content without compensation.

By entering paid licensing agreements, OpenAI and Google are taking a more legally sound approach to accessing high-quality data.

This also allows Reddit to monetize its content without relying solely on ad revenue.


Reddit’s Stock Drop and Impact of Google’s Algorithm Changes

Despite the boost in revenue from its AI licensing deals, Reddit’s stock dropped 15% in after-hours trading following its earnings report.

One of the possible reasons? Lower-than-expected user growth.

This suggests that Google’s search algorithm adjustments may have reduced Reddit’s visibility, impacting user engagement.

How Google’s Algorithm Affects Reddit

Historically, Reddit has benefited significantly from high rankings in Google Search.

Many users land on Reddit pages by searching for product reviews, discussions, or niche topics.

However, if Google tweaks its algorithm to lower Reddit’s ranking, fewer people may discover its content, leading to slower user growth and decreased ad revenue.

Reddit’s dependence on Google for organic traffic is a potential risk, even as it diversifies its revenue streams with AI licensing deals.


Google’s Favoritism Towards Reddit in Search Results Sparks SEO Concerns

At the same time, some in the SEO (search engine optimization) community argue that Google is actually favoring Reddit too much.

“Either way, [Reddit] was already facing backlash among SEOs because Google appears to be favoring the platform in the Discussions and forums SERP feature. Reddit is even outranking original sources of content in some instances.”

Reddit’s Increased Presence in Search Results

Google has recently prioritized “Discussions and forums” in search results, often pushing Reddit threads above other sources—including news articles and expert-written content.

This shift has raised concerns among publishers, bloggers, and independent websites that rely on Google for traffic.

If Reddit discussions rank higher than original, authoritative content, it could impact the visibility and revenue of these websites.

Google has previously said that its Helpful Content Update is designed to promote high-quality and informative content.

However, if forum discussions consistently outrank in-depth research articles, it could change the way information is surfaced on search engines.


What This Means for the Future of AI and Content Licensing

The OpenAI-Reddit deal signals a major change in how AI companies acquire and pay for content.

Instead of scraping data from the internet, firms like OpenAI and Google are now legally securing large-scale datasets through direct partnerships.

Key Questions

However, this raises several key questions about the future of AI and online content:

  • Will other platforms follow Reddit’s lead?

More companies may start charging AI firms for content, especially as AI-generated responses become a larger part of search engines.

  • Should Reddit users be compensated?

Reddit’s content comes from millions of unpaid users who contribute discussions and insights. If Reddit profits from AI licensing deals, should users receive a share of that revenue?

  • How will search rankings evolve?

If Google continues prioritizing Reddit in search results, will other websites struggle to compete?

  • Will publishers and media companies strike similar deals?

Major publishers, news organizations, and blogs may negotiate their own licensing agreements to ensure fair compensation for their content.

Reddit’s AI licensing deals with OpenAI and Google demonstrate how social platforms are evolving in the AI era.

While Reddit has successfully diversified its revenue streams, it still faces challenges related to user growth, search visibility, and SEO competition.

As AI companies continue seeking legally sourced training data, it’s likely that more content platforms will explore similar licensing opportunities.

However, as these agreements grow, questions about data ownership, user compensation, and search engine fairness will become even more critical.

The long-term impact of these deals remains uncertain, but one thing is clear: the relationship between AI companies, content platforms, and search engines is rapidly changing.

For more news and trends, visit AI News on our website.

Was this article helpful?
YesNo
Generic placeholder image
Articles written 861

Khurram Hanif

Reporter, AI News

Khurram Hanif, AI Reporter at AllAboutAI.com, covers model launches, safety research, regulation, and the real-world impact of AI with fast, accurate, and sourced reporting.

He’s known for turning dense papers and public filings into plain-English explainers, quick on-the-day updates, and practical takeaways. His work includes live coverage of major announcements and concise weekly briefings that track what actually matters.

Outside of work, Khurram squads up in Call of Duty and spends downtime tinkering with PCs, testing apps, and hunting for thoughtful tech gear.

Personal Quote

“Chase the facts, cut the noise, explain what counts.”

Highlights

  • Covers model releases, safety notes, and policy moves
  • Turns research papers into clear, actionable explainers
  • Publishes a weekly AI briefing for busy readers

Related Articles

Leave a Reply