• renzhexiangjiao@piefed.blahaj.zone
    link
    fedilink
    English
    arrow-up
    33
    arrow-down
    1
    ·
    14 days ago

    you can like… enforce this rule programatically? you don’t have to say “pretty please” to ai? basically, when AI requests some potentially unwanted thing (like deleting an email), this request goes through a proxy that asks the human for confirmation. Also you can have a safe word set up in the chat interface to act as a killswitch. I thought these are ABCs of ai safety but apparently these are foreign concepts to this “safety director”

    • underscores@lemmy.zip
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      1
      ·
      edit-2
      14 days ago

      The people that design AI tools don’t implement guardrails because then they’d have to admit AI is not ready for the shit they’re trying to make

      • rumba@lemmy.zip
        link
        fedilink
        English
        arrow-up
        1
        ·
        13 days ago

        AI will never be ready. Humans aren’t ready either. That’s why IT staff uses guardrails for users :)

    • RoyaltyInTraining@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      13 days ago

      OpenClaw’s whole thing is that you give it unrestricted access to your Computer and online accounts. It’s made for people who do not want to think about safety.

    • BadlyDrawnRhino @aussie.zone
      link
      fedilink
      English
      arrow-up
      2
      ·
      13 days ago

      You say that, but who do you think the AIs will go after first if they ever do develop actual intelligence? In that scenario, simple manners can go a long way!