• FaceDeer@fedia.io
    link
    fedilink
    arrow-up
    1
    ·
    2 months ago

    Training doesn’t involve copying anything, so I don’t see why they wouldn’t. You need to copy something to violate copyright.

    • enumerator4829@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 months ago

      There is an argument that training actually is a type of (lossy) compression. You can actually build (bad) language models by using standard compression algorithms to ”train”.

      By that argument, any model contains lossy and unstructured copies of all data it was trained on. If you download a 480p low quality h264-encoded Bluray rip of a Ghibli movie, it’s not legal, despite the fact that you aren’t downloading the same bits that were on the Bluray.

      Besides, even if we consider the model itself to be fine, they did not buy all the media they trained the model on. The action of downloading media, regardless of purpose, is piracy. At least, that has been the interpretation for normal people sailing the seas, large companies are of course exempt from filthy things like laws.

      • FaceDeer@fedia.io
        link
        fedilink
        arrow-up
        1
        ·
        2 months ago

        Stable Diffusion was trained on the LIAON-5B image dataset, which as the name implies has around 5 billion images in it. The resulting model was around 3 gigabytes. If this is indeed a “compression” algorithm then it’s the most magical and physics-defying ever, as it manages to compress images to less than one byte each.

        Besides, even if we consider the model itself to be fine, they did not buy all the media they trained the model on.

        That is a completely separate issue. You can sue them for copyright violation regarding the actual acts of copyright violation. If an artist steals a bunch of art books to study then sue him for stealing the art books, but you can’t extend that to say that anything he drew based on that learning is also a copyright violation or that the knowledge inside his head is a copyright violation.

    • Phoenixz@lemmy.ca
      link
      fedilink
      English
      arrow-up
      0
      ·
      2 months ago

      I hate lawyer speak with a passion

      Everyone knows what we’re talking about here, what we mean, and so do you

      • FaceDeer@fedia.io
        link
        fedilink
        arrow-up
        1
        ·
        2 months ago

        And yet if one wishes to ask:

        Did they have the right to do that?

        That is inherently the realm of lawyer speak because you’re asking what the law says about something.

        The alternative is vigilantism and “mob justice.” That’s not a good thing.