Reddit said in a filing to the Securities and Exchange Commission that its users’ posts are “a valuable source of conversation data and knowledge” that has been and will continue to be an important mechanism for training AI and large language models. The filing also states that the company believes “we are in the early stages of monetizing our user base,” and proceeds to say that it will continue to sell users’ content to companies that want to train LLMs and that it will also begin “increased use of artificial intelligence in our advertising solutions.”

On Wednesday, Reuters reported that Reddit has entered a contract with Google, which will license its content for $60 million a year in order to train Google’s AI models.

  • rsuri@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1
    ·
    10 months ago

    Basically yes, but unlike Reddit which has control over its proprietary network, Lemmy instances would have a hard time locking down access to create artificial scarcity for their data without causing other problems.