Google is embedding inaudible watermarks right into its AI generated music::Audio created using Google DeepMind’s AI Lyria model will be watermarked with SynthID to let people identify its AI-generated origins after the fact.

  • Stern@lemmy.world
    link
    fedilink
    English
    arrow-up
    45
    arrow-down
    6
    ·
    10 months ago

    People are listening to AI generated music? Someone on Bluesky put (paraphrased slightly) it best-

    If they couldn’t put time into creating it I’m not going to put time into listening to it.

  • AutoTL;DR@lemmings.worldB
    link
    fedilink
    English
    arrow-up
    8
    ·
    10 months ago

    This is the best summary I could come up with:


    Audio created using Google DeepMind’s AI Lyria model, such as tracks made with YouTube’s new audio generation features, will be watermarked with SynthID to let people identify their AI-generated origins after the fact.

    In a blog post, DeepMind said the watermark shouldn’t be detectable by the human ear and “doesn’t compromise the listening experience,” and added that it should still be detectable even if an audio track is compressed, sped up or down, or has extra noise added.

    President Joe Biden’s executive order on artificial intelligence, for example, calls for a new set of government-led standards for watermarking AI-generated content.

    According to DeepMind, SynthID’s audio implementation works by “converting the audio wave into a two-dimensional visualization that shows how the spectrum of frequencies in a sound evolves over time.” It claims the approach is “unlike anything that exists today.”

    The news that Google is embedding the watermarking feature into AI-generated audio comes just a few short months after the company released SynthID in beta for images created by Imagen on Google Cloud’s Vertex AI.

    The watermark is resistant to editing like cropping or resizing, although DeepMind cautioned that it’s not foolproof against “extreme image manipulations.”


    The original article contains 230 words, the summary contains 195 words. Saved 15%. I’m a bot and I’m open source!

    • SuckMyWang@lemmy.world
      link
      fedilink
      English
      arrow-up
      20
      arrow-down
      7
      ·
      edit-2
      10 months ago

      it does this by converting the audio into a 2d visualisation that shows how the spectrum of frequencies evolves in a sound over time

      Old school windows media player has entered the chat

      Seriously fuck off with this jargon, it doesn’t explain anything

      • Terminarchs@slrpnk.net
        link
        fedilink
        English
        arrow-up
        17
        arrow-down
        1
        ·
        10 months ago

        That’s actually an accurate description of what is happening: an audio file turned into a 2d image with the x axis being time, the y axis being frequency and color being amplitude.

      • FishFace@lemmy.world
        link
        fedilink
        English
        arrow-up
        13
        ·
        10 months ago

        Sounds like a bad journalist hasn’t understood the explanation. A spectrogram contains all the same data as was originally encoded. I guess all it means is that the watermark is applied in the frequency domain.

          • FishFace@lemmy.world
            link
            fedilink
            English
            arrow-up
            4
            ·
            10 months ago

            Well, encoding stuff in the spectrogram isn’t new, sure. But encoding stuff into an audio file that is inaudible but robust to incidental modifications to the file is much harder. Aphex Twin’s stuff is audible!

            • SuckMyWang@lemmy.world
              link
              fedilink
              English
              arrow-up
              3
              ·
              edit-2
              10 months ago

              I would like to know what it is that makes it so robust. The article explains very little. Is it in the high frequencies? Higher than the human ear can hear? Compression will effect that plus that’s going to piss dogs off. Could be something with the phasing too. Filters and effects might be able to get rid of the water mark

              • FishFace@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                10 months ago

                I don’t know what frequencies are annoying for dogs but I’m guessing it’s above 24kHz so no sound file or sound system is going to be able to store or produce it anyway.

                There will certainly be some way to get rid of the watermark. But it might nevertheless persist through common filters.

  • some pirate@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1
    ·
    10 months ago

    Lately in youtube I’m constantly been bombarded with ai garbage music passed as a normal unknown bands and it’s getting really annoying. What will happen when there’s an actual new band but everyone ignores them because you would think it’s just ai?

    • Shayeta@feddit.de
      link
      fedilink
      English
      arrow-up
      6
      ·
      10 months ago

      ai garbage music

      actual new band but everyone ignores them because you would think it’s just ai

      I think you answered your own question.

    • interceder270@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      10 months ago

      What will happen when there’s an actual new band but everyone ignores them because you would think it’s just ai?

      Their music will speak for itself and elevate them above the AI that is making worse music.

      You’re asking the wrong question. What happens when you hear something you like, then find out it’s made by AI and all of a sudden you have to pretend you never liked it?

      • Meowoem@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        0
        ·
        10 months ago

        Music snobs have been doing this for decades, pretending to like the shittiest pink Floyd b-side because the normies don’t get it and acting like Abba’s entire catalogue isn’t solid bangers because disco isn’t cool, until it was again then they’d always loved it.

        It’ll be just like it always is, Pete Seagar with an axe trying to stop Bob Dylan playing an electric guitar. I remember when people hated d&b and said it wasn’t real music and all that shit now they’re all telling bullshit stories about how they were og junglist massive.

        People will use ai to make really cool things and a loud portion of the population will act superior by pretending it’s bad, time will pass and when the next thing comes along all those people will point at the ai music and say ‘your new music will never be as good as real music like that’ but the people listening to atonal arithmic echolocation beats to study to or whatever the next trend is won’t pay them any attention.

  • reksas@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    0
    ·
    10 months ago

    I wonder if being able to generate music will make people less interested in actually bothering to learn how to do it themselves. Having ai tool makes many things so much easier and you need to have only rudimentary understanding of the subject.

    • Meowoem@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      0
      ·
      10 months ago

      Yeah, like most people don’t realise but until about 1900 most piano music was played by humans, of course there were no pianists after the invention of the pianola with its perforated rolls of notes and mechanical keys.

      It’s sad, drums were things you hit with a stick once but Mr Theramin ensured you never see a drummer anymore, while Mr Moog effectively ended bass and rhythm guitars with the synthesizer…

      It’s a shame it would be fun to go see a four piece band performing live but that’s impossible now no one plays instruments anymore.

      People are never going to stop learning to play instruments, if anything they’ll get inspired by using AI to make music and it’ll get them interested in learning to play, they’ll then use ai tools to help them learn and when they get to be truly skilled with their instrument they’ll meet up with some awesomely talented friends to form a band which creates painfully boring and indulgent branded rock.