• terranoid@lemmy.cafe
    link
    fedilink
    English
    arrow-up
    15
    ·
    edit-2
    5 hours ago

    Finally someone said it. I honestly was wondering why no one was complaining about this… I’ve worked on some open source myself, licensed it GPL, and never intended for it to be used as training data.

    Doesn’t the GPL cover shit like this? There should be mass lawsuits hitting any AI that used open source software and didn’t just specifically use BSD projects or something.

    If you train an LLM on GPL code, it should be illegal to sell that LLM and use it commercially without revealing ALL THE SOURCE you used and the source to regenerate that model.

    • merc@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 hours ago

      never intended for it to be used as training data.

      You could have chosen a different license than the GPL.

      Doesn’t the GPL cover shit like this?

      No. Didn’t you read the license you used?

    • AeonFelis@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      4 hours ago

      If you train an LLM on GPL code, it should be illegal to sell that LLM and use it commercially without revealing ALL THE SOURCE you used and the source to regenerate that model.

      Also if that LLM is used to generate code - that code must also be GPL.

      • terranoid@lemmy.cafe
        link
        fedilink
        English
        arrow-up
        2
        ·
        3 hours ago

        I’d love to see lawsuits force Microsoft and Nvidia and OpenAI to open source everything they had AI touch 😁

    • Chronographs@lemmy.zip
      link
      fedilink
      English
      arrow-up
      7
      ·
      4 hours ago

      Yeah I mean they train ai on commercially copyrighted stuff like books that they straight up pirate so if that doesn’t stop them the open source community certainly won’t