Last week we shared a number of updates with our community of users, and now we want to share them here: At Mozilla, we work hard to make Firefox the best
I am just hoping governments will see the massive issues and copyright problems with AI and ban that garbage outright soon so all these companies eager to add their AI trash to every single product they ship will stop.
There is no general copyright issue with AIs. It completely depends on the training material (if even then), so it’s not possible to make blanket statements like that. Banning technology, because a particular implementation is problematic, makes no sense.
The only relevant training material to make a truly complete dataset must include copyrighted material or you do not have a full set of data to draw from and thus it is useless. Stop defending this horrible technology.
Obviously you can not train on 100% of material ever created, so you pick a subset. There is a a lot of permissively licensed content (e.g. Wikipedia) and content you can license (e.g. Reddit). While not sufficient for an advanced LLM, it certainly is for smaller models that do not need wide knowledge.
I am just hoping governments will see the massive issues and copyright problems with AI and ban that garbage outright soon so all these companies eager to add their AI trash to every single product they ship will stop.
There is no general copyright issue with AIs. It completely depends on the training material (if even then), so it’s not possible to make blanket statements like that. Banning technology, because a particular implementation is problematic, makes no sense.
The only relevant training material to make a truly complete dataset must include copyrighted material or you do not have a full set of data to draw from and thus it is useless. Stop defending this horrible technology.
What do you mean “full set if data”?
Obviously you can not train on 100% of material ever created, so you pick a subset. There is a a lot of permissively licensed content (e.g. Wikipedia) and content you can license (e.g. Reddit). While not sufficient for an advanced LLM, it certainly is for smaller models that do not need wide knowledge.