Also, in 2022, several unidentified developers sued OpenAI and GitHub based on claims that the organizations used publicly posted programming code to train generative models in violation of software licensing terms
They can argue about it not being a copy all they want. If there is a single GPL licenced line of code scraped then anything they produce is a derivative work & must be licenced GPL.
The only way I can see them weaseling out of this is by keeping the program running the model made in-house and proprietary while releasing the model in a format unusable without the base (proprietary) program. But maybe the GPL forbids such obfuscstion efforts (I don’t know, I haven’t studied it in detail)
From the article:
They can argue about it not being a copy all they want. If there is a single GPL licenced line of code scraped then anything they produce is a derivative work & must be licenced GPL.
nice.
The only way I can see them weaseling out of this is by keeping the program running the model made in-house and proprietary while releasing the model in a format unusable without the base (proprietary) program. But maybe the GPL forbids such obfuscstion efforts (I don’t know, I haven’t studied it in detail)