• 1 Post
  • 52 Comments
Joined 1 year ago
cake
Cake day: June 12th, 2023

help-circle















  • No it doesn’t, the training data isn’t inside the LLM.

    This is factually incorrect. You can extract the data. How do you think the legal cases are being brought?

    For example

    The model has to contain the data in order to produce works.

    Wholesale commercial copyright infringement where you’re profiting off of others work on a large scale is a whole different ball game.

    They’re training their models on large amounts of pirated content and profiting off it.

    Of course the rights holders are going to say “wait a minute, why are you making money off my content without my permission? And how much of my work did you pirate to use?”

    You cannot hand wave away mass piracy to train their models, and then distribute said models based on an act of mass copyright infringement.

    Do you not understand the basics of the law?

    its idiotic to think that its reasonable to demand such a thing.

    Again, the law is the law. If they mass pirate a bunch of media which then the model contains chunks of they are breaking the law.

    I can’t believe this is a hard concept for someone to understand.