• IllNess@infosec.pub
      link
      fedilink
      English
      arrow-up
      4
      ·
      7 months ago

      All these AI and machine learning companies are taking content directly from websites and ignoring robot.txt files.

      If your content is able to be crawled, even without being listed on search engines, I don’t think it really matters.

      • T156@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        7 months ago

        It might help proof an AI company against legal issues that might be brought about by their using the content. If they’re ever sued by Automattic, then they can just point to the deal and say that they bought the data from them. There’s much less ambiguity.

  • Please_Do_Not@lemm.ee
    link
    fedilink
    English
    arrow-up
    8
    ·
    7 months ago

    I work in marketing, and every client I work with who has a WordPress website is using AI to write a lot of their content. This is going to lead to circularly trained AI for sure.

  • herrcaptain@lemmy.ca
    link
    fedilink
    English
    arrow-up
    7
    ·
    7 months ago

    I’m assuming this just relates to WordPress.com rather than the open-source WordPress.org but it’s still a bummer. I’ve worked with the open source platform for over a dozen years and have started to kinda loathe what it’s turned into but I’m not sure I’m yet at the point where I’m ready to migrate a bunch of sites to something else. This could be that push if they keep going down this road.

    God, am I getting too old for this shit? I’m a pretty technical person but this AI nonsense is just relentless. I’m not philosophically against the idea of AI as like any tool it has the potential to better the world, but every tech company and their dog are going all in on using it for commercial bullshit that seems to provide very little value to society. Even fucking Mozilla is going in that direction.

    • fruitycoder@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      3
      ·
      7 months ago

      Mozilla seems more towards local and privacy preserving AI Dev, no? Both are really lacking in the space IMHO

      Like I’m not interested in what the collective of digital knowledge looks like behind several corporate filters and giant rent seeking moat.

      • herrcaptain@lemmy.ca
        link
        fedilink
        English
        arrow-up
        3
        ·
        7 months ago

        True, and I get that realistically they do need to diversify away from Firefox … but it still feels bandwagoney to me given that seemingly every tech company (and Wendy’s) are piling into the AI train all at once. Like I said, though, I think I’m just getting too old for this.

        • fruitycoder@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          1
          ·
          7 months ago

          They were already making some good work in the field before but they trended away from it.

          Honestly it just seems like they struggle with follow through.

    • Traister101@lemmy.today
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      7 months ago

      It’s the new NFTs and Crypto but it’s not blatantly a scam so the companies that skipped out on those sure as shit will be hoping onto AI

    • CosmoNova@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      7 months ago

      I don‘t really know what to say to cheer you up. Industrial revolutions are as important and exciting as they are painful, even dreadful to many. I’ve seen no signs of this one being different. There will be a lot of losers before we can expect wide spread benefits for society from it. The current working class will suffer great losses and will have to fight so another can reap the benefits later.

  • donuts@kbin.social
    link
    fedilink
    arrow-up
    5
    ·
    7 months ago

    Funny how all of these social media platforms that were so happy to describe themselves as “the public town square of the internet” or whatever are now claiming that they own everything that everyone ever posted. So, which is it? Because it obviously cannot be both.

  • phoneymouse@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    7 months ago

    All of this is predicated on having some company that can afford to pay and wants this data. Or, the next tech bubble will just be VCs throwing money at AI companies training their models on the old internet.

  • harsh3466@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    7 months ago

    Shit like this should be opt in by default. But no. Instead of respecting the users they count on ignorance, forgetfulness, and obfuscation for this kind of fuckery.

  • SuperSynthia@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 months ago

    Not only am I really glad to not be on tumblr, but this further shows I shouldn’t use wordpress for my website even though there is an opensource version

  • Nikelui@piefed.social
    link
    fedilink
    arrow-up
    1
    ·
    7 months ago

    I wonder if there is a text equivalent of Glaze and Nightshade, to perform adversarial attacks on AI scraping the text.

  • FrostKing@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 months ago

    Can someone please outline the main reasons people are upset with these sites for choosing to do this?

  • CosmoNova@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 months ago

    Remember when Xitter started selling the checkmark and now every platform is rolling out something identical? What about Netflix cracking down on sharing and adding ads to their lowest tier? Yeah this is that.

  • LunaCtld@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    7 months ago

    I welcome this change actually. Now users can clearly see what others have been saying forever: If you don’t pay for the product, you ARE the product.

      • Crack0n7uesday@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        7 months ago

        With Linux you pay for support if you ever need it. Most end users will never need support, but businesses running Linux servers pay Red Hat a shit load to support them in case shit ever hits the fan. Like giving away a free car, but only certain people know how to do maintenance on it, and they all work for the manufacturer.