• Blackmist@feddit.uk
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 months ago

    They always were.

    Only now they’ve agreed to pay Reddit for it. This is what their third party lockdown was really all about.

    They’re helping themselves to your Lemmy comments for free, as that’s just how it’s designed. If you post anything publicly anywhere, it’s getting slurped up by a bot somewhere.

    • just another dev@lemmy.my-box.dev
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 months ago

      I’m not a lawyer. But isn’t the reason they had to go to reddit to get permission is because users hand over over ownership to reddit the moment you post. And since there’s no such clause on Lemmy, they’d have to ask the actual authors of the comments for permission instead?

      Mind you, I understand there’s no technical limitation that prevents bots from harvesting the data, I’m talking about the legality. After all, public does not equate public domain.

      • GamingChairModel@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 months ago

        users hand over over ownership to reddit the moment you post

        Not ownership. Just permission to copy and distribute freely. Which basically is necessary to run a service like this, where user-submitted content is displayed.

        And since there’s no such clause on Lemmy, they’d have to ask the actual authors of the comments for permission instead?

        It’s more of a fuzzy area, but simply by posting on a federated service you’re agreeing to let that service copy and display your comments, and sync with other servers/instances to copy and display your comments to their users. It’s baked into the protocol, that your content will be copied automatically all over the internet.

        Does that imply a license to let software be run on that text? Does it matter what the software does with it, like display the content in a third party Mobile app? What about when it engages in text to speech or braille conversion for accessibility? Or index the page for a search engine? Does AI training make any difference at that point?

        The fact is, these services have APIs, and the APIs allow for the efficient copying and ingest of the user-created information, with metadata about it, at scale. From a technical perspective obviously scraping is easy. But from a copyright perspective submitting your content into that technical reality is implicit permission to copy, maybe even for things like AI training.

      • Alimentar@lemm.ee
        link
        fedilink
        English
        arrow-up
        0
        ·
        2 months ago

        Well even if it was a legal argument, they wouldn’t care. Like Facebook and all the rest. They say they don’t share your data but we all know that’s a lie

  • myliltoehurts@lemm.ee
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    So they filled reddit with bot generated content, and now they’re selling back the same stuff likely to the company who generated most of it.

    At what point can we call an AI inbred?

  • Everythingispenguins@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    2 months ago

    Some day historians will be able to look back at this moment and be able to determine it was what caused ChatGPT to become horny and weird.

  • FJT@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    arrow-down
    3
    ·
    2 months ago

    So it’s going to be a libtarded libtard AI that doesn’t represent the majority of the people, got it.