Skyrim VAs are speaking out about the spread of pornographic AI mods.
The porn bit gets headlines, but it isn’t the core of the issue.
All of these models retain a representation of the original training data in their parameters, which makes training a violation of copyright unless it was explicitly authorized. The law just hasn’t caught up yet, since it is easy to obfuscate this fact with model mumbo-jumbo in between feeding in voices and generating arbitrary output.
The big AI players are betting that they will be able to entrench themselves with a massive data advantage before regulation locks down training and effectively kills any future competition. They will already have their models, and the worst case at that point is paying some royalties to people whose data was used in training.
I’d like to know how do you expect the governments or even private institutions to enforce this since most of the countries won’t care about foreign laws.
They can forbid companies from using the AI to do business in their areas, like the EU is doing with privacy laws. Google not being able to use its chatbot search in the US would be a big deal.
Sounds to me you would have to prove someone used AI in their work first, therefore making it difficult to realistically enforce.
Not hard when they’re advertising it right now. And if they do try to keep it secret all the government will have to do is subpoena a look at the backend.
But honestly, since when do we just not have laws when it’s hard to prove. It’s hard to prove someone INTENDS to murder someone, but that’s a really important legal distinction. It’s hard to prove someone’s faking a mental illness, but that’s another thing that’s got laws around it. It’s really hard to prove sexual assault, but that needs to still be outlawed too.
Compared to that stuff? Proving someone used an AI is going to be a piece of cake with all the data that gets collected and the amount of work it would take to REMOVE the AI from a business process before the cops get there.
Enforcing a potential AI ban in work environments is unrealistic right now because it’s challenging to prove that AI was actually used for work purposes and then enforce such a ban. Let’s break it down in simple terms.
Firstly, proving that AI was used for work is not straightforward. Unlike physical objects or traditional software, AI systems often operate behind the scenes, making it difficult to detect their presence or quantify their impact. It’s like trying to catch an invisible culprit without any clear evidence.
Secondly, even if someone suspects AI involvement, gathering concrete proof can be tricky. AI technologies leave less visible traces compared to conventional tools or processes. It’s akin to solving a mystery where the clues are scattered and cryptic.
Assuming one manages to establish AI usage, the next hurdle is enforcing the ban effectively. AI systems are often complex and interconnected, making it challenging to untangle their influence from the overall work environment. It’s like trying to remove a specific ingredient from a dish without affecting its overall taste or texture.
Moreover, AI can sometimes operate subtly or indirectly, making it difficult to draw clear boundaries for enforcement. It’s like dealing with a sneaky rule-breaker who knows how to skirt around the regulations, all you have to do is ask.
Considering these challenges, implementing a ban on AI in work environments becomes an uphill battle. It’s not as simple as flipping a switch or putting up a sign. Instead, it requires navigating through a maze of complexity and uncertainty, which is no easy task.
Are you sure? Training with VA data counts as privacy violation?
No, that was just an example of how such a law may be enforced.
Honestly, we may need to get our lawmakers to expand the definition of “identity theft” to really cover this stuff. A VA’s voice is their innate, distinctive personal signature. It is their livelihood. We’ll have to see what a court says about copyright but it’s certainly not right for it to be just copied by an AI.
Thing is, the courts, in the US at least, have already made a decision about this.
Impersonation of a voice requires permission from the original artist. AI is no different. It should require permission.
All of these models retain a representation of the original training data in their parameters, which makes training a violation of copyright unless it was explicitly authorized.
How can you argue it’s copyright violation while also saying the law hasn’t caught up?
The law just hasn’t caught up yet
It’s a violation of the spirit of the law, just not the letter (yet). It’s a good bet that it will be updated soon, especially when you consider that copyright laws get regularly updated to protect the IP of large corporations
Correct me if I’m wrong, but I don’t believe voices can be copyrighted. After all, if a human can replicate someone else’s voice, they get booked as professional impersonators rather than sued into oblivion.
The difference here is that the voice replication happens though AI now. Would we see the same outrage if the voices in these mods were just people that sounded like the original voice actors?
Copyright law needs to be fortified or a lot of voice actors are about to get screwed over big time. AI voice replication by modders is only the beginning, once big companies find the output acceptable these people may very well lose their jobs.
The legal grounds are that the AI is trained using voice lines that can indeed be copyrighted material. Not the voice itself, but the delivered lines.
The problem with that approach is that the resulting AI doesn’t contain any identifiable “copies” of the material that was used to train it. No copying, no copyright. The AI model is not a legally recognizable derivative work.
If the future output of the model that happens to sound very similar to the original voice actor counts as a copyright violation, then human sound-alikes and impersonators would also be in violation and things become a huge mess.
The problem with that approach is that the resulting AI doesn’t contain any identifiable “copies” of the material that was used to train it. No copying, no copyright. The AI model is not a legally recognizable derivative work.
That’s a HUGE assumption you’ve made, and certainly not something that has been tested in court, let alone found to be true.
In the context of existing legal precedent, there’s an argument to be made that the resulting model is itself a derivative work of the copyright-protected works, even if it does not literally contain an identifiable copy, as it is a derivative of the work in the common meaning of the term.
If the future output of the model that happens to sound very similar to the original voice actor counts as a copyright violation, then human sound-alikes and impersonators would also be in violation and things become a huge mess.
A key distinction here is that a human brain is not a work, and in that sense, a human brain learning things is not a derivative work.
That’s a HUGE assumption you’ve made
No, I know how these neural nets are trained and how they’re structured. They really don’t contain any identifiable copies of the material used to train it.
and certainly not something that has been tested in court
Sure, this is brand new tech. It takes time for the court cases to churn their way through the system. If that’s going to be the ultimate arbiter, though, then what’s to discuss in the meantime?
No, I know how these neural nets are trained and how they’re structured. They really don’t contain any identifiable copies of the material used to train it.
Go back and read my comment in full, please. I addressed that directly.
Also, neural network weights are just a bunch of numbers, and I’m pretty sure data can’t be copyrighted. And yes, images and sounds and video stored on a computer are numbers too, but those can be played back or viewed by a human in a meaningful way, and as such represent a work.
Also, neural network weights are just a bunch of numbers, and I’m pretty sure data can’t be copyrighted.
Just being “a bunch of numbers” doesn’t stop it from being a work, it doesn’t stop it from being a derivative work, and you absolutely can copyright data – all digitally encoded works are “just data”.
A trained AI is not a measurement of the natural world. It is a thing that has been created from the processing of other things – in the common sense of it the word, it is derivative of those works. What remains, IMO, is the question of if it would be a work, or something else, and if that something else would be distinct enough from being a work to matter.
Just being “a bunch of numbers” doesn’t stop it from being a work, it doesn’t stop it from being a derivative work
I suggest reading my entire comment.
A trained AI is not a measurement of the natural world. It is a thing that has been created from the processing of other things – in the common sense of it the word, it is derivative of those works. What remains, IMO, is the question of if it would be a work, or something else, and if that something else would be distinct enough from being a work to matter.
It’s only a work if your brain is a work. We agree that in a digitized picture, those numbers represent the picture itself and thus constitute a work (which you would have known if you read beyond the first sentence of my comment). The weights that make up a neural network represent encodings into neurons, and as such should be treated the same way as neural encodings in a brain.
In that case all work would be derivative.
No? No. Not all work is analogous to training a generative model. That’s a really bizarre thing to say, and I’m shocked to hear it from you.
I’m finally reading a comment from someone who actually knows how machine learning works. Too many people craft their argument before learning about the technology. Well, they think reading a few blog articles counts as research maybe.
Unfortunately, the courts and legislatures may craft their opinions and laws, respectively, without knowing how machine learning actually works.
That’s a decent theoretical legal basis, but the voice lines are property of the game company rather than the voice actors.
If this interpretation of copyright law on AI models will be the outcome of the two (three?) big AI lawsuits related to stable diffusion, most AI companies will be completely fucked. Everything from Stable Diffusion to ChatGPT 4 will instantly be in trouble.
Making derivatives of existing game assets is a core part of modding. I don’t see how this is any different from splicing existing voice lines to make them say whatever you want them to say.
Maybe it’s morally wrong to use the work of voice actors for NSFW purposes without their consent, but I’m not sure if it’s illegal from a copyright standpoint.
I think this could technically fall under name, image, likeness rights. Your voice could count as likeness and can’t be used without your concent. I know the big movie studios need to get permission from the family to do their creepy dead actor cameos like Disney did with Peter Cushing. I think this would be the same.
Humans can’t entirely replicate one another’s voices. I recognize voices far better than faces, and I know I’m not the only one out there who does so. There are a lot of good imitators out there, but they can’t replicate another voice.
Frito lay copied Tom Waits voice and he sued. He won. I think the judge even got him more than he asked.
Damn it. All good things have to be ruined by creepy people.
It is a can of worms we were going to have to figure out either way. The power of these tools are impressive but they can get a little out of hand as shown with fools making NSFW mods with them but that is only one aspect. We have to keep in mind we can see companies screwing VAs out of jobs because of this tech as well which will likely have far more importance. In a way this tech can also be liberating for bad actors since we are now entering into a new phase of “Post truth” reality or I guess it may be better to say the democratization of fake news/false narratives, we were already at the door of this era but these AIs put us way past that door now. Its rather unfortunate since the power of this stuff can lead to some insanely cool concepts. Like the idea that fans could in theory restore cut content from games and even be able to replicate voice acting to make it feel closer to the original game. Hell how about inserting voice acting in games that never had it or were nonviable to have voice acting for every character in things like Baldur’s gate, neverwinter nights, planescape torment, original fallouts, etc. Many of these games are beloved classics but they are sometimes missing some of the things we are used to in more modern games.
Joan is Horrible…
Absolutely Black Mirror territory here. I do hope that game studios work on stopping this.
I don’t think it’s a good idea to have it be illegal to sound similar to another person.
Everyone has takes like this like we can’t just draw the line at making it illegal for a computer or algorithm to do this.
We can, but why should we? The end result is the same.
It’s not though. That’s you naively ignoring the aspects that are different. Computer generated imitation is easier to create, can be created at a scale far eclipsing human action, and can be finely tuned to make it harder to discern.
You can look up impressionists and you’ll find it’s a rather small club when looking for the true greats. Computers remove this barrier and allow any asshole with an internet connection to create a video of you screaming racial epithets if your voice is easy enough to access.
The vast difference in scale can’t be ignored.
Talking up the capabilities of AI voice acting is not really helping the case against it. If it’s really so good, and laws are enacted that forbid mixing human and AI voice acting, then I expect the straightforward optimal solution would be to entirely eliminate the human voice actors going forward.
You write laws for the future. Unless you think ai generated content has plateaued. Which is again, naive. Just because social media wasn’t popular at first doesn’t mean we should’ve waited on passing data privacy legislation like we have. It’s good to identify potential issues and attempt to mitigate them early. So we don’t get situations like our current climate status.
I don’t think it’s plateaued, I think it’s going to get significantly better from here onward.
I’m not sure what laws you’re proposing at this point. Are you suggesting that AI should be forbidden from “mimicking” a human voice actor? That’s what I’m suggesting will lead even more quickly to AI-only projects that get rid of the human voice actors entirely, since having a human voice actor under laws like that would end up as a huge hindrance.
Who’s Joan? My Netflix only has a Madison_rogue is awful episode.
But in all seriousness, the tech is here and it’s here to stay. Legal stuff can curb it for a bit, but you know internet. It’s only going to grow.
There should be some other way to deal with it besides “Forbid everything now!”.
I’ve seen several future copyright systems being discussed which might provide a better way of doing things going forward. Like the one where all copyright is basically waived, but royalties are collected and distributed to all participants from any derivative work. Want to make a star wars fan movie? You’re free to do so (and everyone is), but every actor you use and every writer whose part of the lore you’re using gets a small cut from whatever to earn on it (so non-profit derivative work doesn’t require paying royalties, similar to parodies in the US).
Kinda like sampling in music currently works. Could be a better system than what we currently have, with corporations owning hundreds of years of usage for one story or likeness.
I feel like at some point this is going to lead to a very clunky ban on using AI for voices, which specifically exempts human impressionists.