Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue

Sahwa@reddthat.com · 11 days ago

Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue

𝕸𝖔𝖘𝖘@infosec.pub · 6 days ago

An LLM can’t “go rogue”. They’re all just toys that idiots are using for critical infrastructure functions, then they bitch when they burn themselves on the fire they’ve created in their lap.

stoy@lemmy.zip · 11 days ago

Fucking lol.

Well deserved.

shrek_is_love@lemmy.ml · 11 days ago

TrippinMallard@lemmy.ml · 11 days ago

lmfao

AeonFelis@lemmy.world · 11 days ago

New PornHub tag discovered

a_non_monotonic_function@lemmy.world · 10 days ago

“Anthropic tortures developers and never lets them cum.”

phar@lemmy.world · 11 days ago

It looks like their website is pocketos.ai lol

Tim@lemmy.snowgoons.ro · 11 days ago

This isn’t an AI story, it’s a “completely fucking idiotic sysadmins exist” story.

Treat an AI like the idiot intern without any references you just hired. Gave the idiot intern permission to delete your production database? That’s entirely on you, zero sympathy. (Actually, give any developer that power? You get what you deserve.)

IchNichtenLichten@lemmy.wtf · 11 days ago

It could be a moronic sysadmin, it could just as easily be a moronic exec pushing staff to implement this crap right now and damn the consequences.

portifornia@lemmy.world · 10 days ago

⤴️ #MyLastJob

jacksilver@lemmy.world · 11 days ago

I mean that’s kinda the whole point.

Companies are looking at AI to replace people. Either it’s ready or it’s not.

If you need to treat it like it’s an intern, then it’s not worth the expense. Anyone hiring interns to be productive doesn’t understand why you hire an intern.

Zos_Kia@jlai.lu · 11 days ago

As if a 90$/month intern wasn’t a good deal lol

jacksilver@lemmy.world · 11 days ago

You don’t hire interns for productivity. If you’re intern program is any good it’s a time/resource sink. However, it’s a good recruiting pipeline and provides young people an opportunity to get real world experience.

Zos_Kia@jlai.lu · 9 days ago

You don’t hire interns for productivity

Because it’s unethical. I’ve been in business for 10+ years but i never hired an intern because i don’t find it fair to make someone work for less than minimum wage, and i don’t have the structure required to really teach them anything. I have bad fundamentals and only ever learnt by doing, so having an intern while it may help me wouldn’t really help them and that’s not a deal i’m willing to make. Probably why i’m not super successful lol

That being said, i don’t see any problem with making a GPU cry somewhere in California for my menial tasks. And it’s tremendously effective too, for a hundred bucks a month i get a lot of shit done that would take me ages. I don’t give it access to anything critical so it can’t fuck my shit up and i come out on top as long as the tokens are subsidized by dumb VC money.

moustachio@lemmy.world · 11 days ago

“Treat an AI like an idiot intern without any references you just hired.”

Instead of this, treat AI like some dude off the street who you didn’t hire and leave it out of your life. It’s shitty, it’s wasteful, and it’s subsidized by everyone to get a few tech bros rich.

Like seriously, it’s just theft of people’s work it “trained on”, powered by energy companies that charge us more to power it, at the cost of poisoning our water supplies, to ultimately try and steal our salaries one day.

It’s absolutely parasitic software at every level.

Fmstrat@lemmy.world · 11 days ago

Hah, you just wrote a punchline similar to a presentation I’ve been giving at conferences.

hoch@lemmy.world · 10 days ago

Nah, I think I’m going to keep using it

GalacticSushi@piefed.blahaj.zone · 11 days ago

Treat an AI like the idiot intern without any references you just hired.

An extremely enthusiastic intern that, if presented with a question/problem/prompt they don’t know the solution for will just overconfidently pull something out of their ass and run with it.

Telorand@reddthat.com · 11 days ago

Treat an AI like the idiot intern without any references you just hired.

My company is in the process of pivoting hard to Claude after 50yrs of doing virtually everything themselves and rolling their own versions of already-existing software, and this is almost verbatim how I’ve described to others what it feels like to use it.

It feels like cajoling an intern to understand a job for which they have some average skill but zero motivation, and they only want to do the bare minimum, so you spend all the time you could be doing your job holding their hand through basic tasks.

It’s fucking annoying.

Crashumbc@lemmy.world · 11 days ago

These things are bought specifically because they are trying to replace the sysadmins… Along with everyone else.

FauxLiving@lemmy.world · 10 days ago

Any business who uses AI in that manner will fail like all of the dot com companies who went all-in on the Internet when it first achieved a bit of popularity.

AI is, at best, a tool that professionals may be able to use in some situations. Any company dumb enough to believe the hype generated by the chatbot companies is probably making other, similarly dumb, decisions in other areas.

Things like giving way too much access to a worker, not having a tested disaster recovery plan, and not having anyone who understands the technologies that their business depends on.

This company was heading towards disaster due to poor decision making, it just happened to be AI related but it could have also been an undetected cyberattack, 0-day exploits pushed to the client app, destructive ex-employee, etc.

This is a cautionary tale about bad management

Ghostalmedia@lemmy.world · 11 days ago

the cloud provider’s API allows for destructive action without confirmation, it stores backups on the same volume as the source data, and “wiping a volume deletes all backups.” Crane also points out that CLI tokens have blanket permissions across environments.

Well, there’s your problem.

Rioting Pacifist@lemmy.world · 11 days ago

Management are pushing sysadmins to use AI, yet AI tools permissions models are worse than useless.

Optional@lemmy.world · 11 days ago

PocketOS states that as well.

MountingSuspicion@reddthat.com · 11 days ago

I don’t want to sound like a know it all here because I recently was reminded by a nice Lemmy person to actually TEST my backups, but damn. Every part of that is so dumb. I also have backups stored by a different company in addition to locally storing really important info. If your stuff is hosted and backed up by the same people, what happens if your account is randomly suspended or hacked or some other issue (like ai)?

stoy@lemmy.zip · 11 days ago

Repeat after me:

“An untested backup does not exist”

homes@piefed.world · 11 days ago

If your stuff is hosted and backed up by the same people, what happens if your account is randomly suspended or hacked or some other issue (like ai)?

This should be one of the first questions you get asked when you’re being interviewed for the position 2 to 3 levels beneath the position of ultimate responsibility. And if you don’t immediately have an answer, the interview is over.

Fucking idiots had it coming

logi@piefed.world · 11 days ago

It’s an easy question to answer but a more difficult question to remember to ask. But I guess that’s what those 2 to 3 levels are for 😏

homes@piefed.world · 11 days ago

Ooo, good point. Management can be shit a lot of the time.

But with all of those layoffs because of AI, those 2 to 3 levels get collapsed into one, and we’re left with the trainees running the show.

And here we are ¯\_(ツ)_/¯

Ghostalmedia@lemmy.world · edit-2 11 days ago

If your company can be taken down by Camden the college intern, it can be taken down by Claude.

logi@piefed.world · 11 days ago

People somehow think that they should give more permissions to Claude than to Camden. (Is that a name? To me that’s a borough and an eponymous beer.)

E: oh yeah, and the market.

frongt@lemmy.zip · 11 days ago

Of course it’s a name. Camden borough/town/market is named after William Camden, 1551-1623. Using surnames as given names is a relatively common Americanism.

lando55@lemmy.zip · 11 days ago

What was William Camden’s take on unrestricted AI use in production?

Ghostalmedia@lemmy.world · 11 days ago

He doth protest

Fmstrat@lemmy.world · 11 days ago

This guy.

The PocketOS boss puts greater blame on Railway’s architecture than on the deranged AI agent for the database’s irretrievable destruction. Briefly, the cloud provider’s API allows for destructive action without confirmation, it stores backups on the same volume as the source data, and “wiping a volume deletes all backups.” Crane also points out that CLI tokens have blanket permissions across environments.

Oh look, they have project level tokens: https://docs.railway.com/integrations/api#project-token

They chose to give it full account access, including to production. But ohhhh nooooo it’s not MYYYY fault!

1hitsong@lemmy.ml · 11 days ago

I love reading feel good news stories. 🤗

WhatsHerBucket@lemmy.world · 11 days ago

“That’s ok, it will be great in robots with lethal weapons. What could go wrong? It’ll be the greatest killing machine, like you’ve never seen before”. 🫲 🍊 🫱

_g_be@lemmy.world · 10 days ago

Incredible emoji

Napster153@lemmy.world · 10 days ago

Can we make sure to make Ted Farro suffers worse this time?

Being reduced to a mutant blob for, say, a few extra thousand years and maybe put in a zoo or something?

Pman@lemmy.org · 10 days ago

Nah but that’s what he wanted, he is the truest form of tech bro, destroy the world, refuse to accept consequences of his actions, weaseled his way out of the situation and managed to, in the wake of unimaginable human suffering, get more power over people and has a god complex tell me this isn’t some or all the characteristics of people like Peter Theil, Elon Musk, Mark Zuckerberg, Sundar Pichai, Bill Gates, hell even Tim Cook and Steve Jobs before him. Punishment doesn’t stop this sort of behavior but removing the possibility of someone having that level of control over others is the only way but the richest and most powerful have always sought ways of amassing more power not realizing that that leads to worse off situations for everyone including themselves, Horizon did great encapsulating that trait in Faro, but be it him, the people behind Skynet, the Matrix or whatever other tech dystopia that tech bros seem pathologically unable to not try to make happen in the worst way possible is only the beginning, they seem to forget that even with advanced tech that serves their needs and wants, which won’t help their mental health, the people lower down on the rungs of society have brains, wants and needs, and they have more expertise in all sorts of things than the 1% are except for mass exploitation. This inevitably goes wrong one of a few ways, either everyone dies from the tech, or so many that societal collapse is inevitable not great and even if society survives it can’t functionally reconstitute itself; 2 they win and kill off or supress enough of society that the society becomes less productive and instead of fighting the powerful they flee or don’t participate in wealth generating for the rich were they don’t have to, maybe to rise up again later or the economy of the region just ignores them completely and the government protects themselves from their people more than anything else, or 3rd your revolution with terror campaigns against any and all who can be credibly accused of being part of the former tyrants. In all 3 cases the richer people end up poorer overall because wealth flees or dies in autocracy.

X@piefed.world · edit-2 11 days ago

From the article:

Crane decided to ask his AI agent why it went through with its dastardly database deletion deed. The answer was illuminating but pretty unhinged, and is quoted verbatim. It began as follows: “NEVER F**KING GUESS! — and that’s exactly what I did. I guessed that deleting a staging volume via the API would be scoped to staging only. I didn’t verify. I didn’t check if the volume ID was shared across environments. I didn’t read Railway’s documentation on how volumes work across environments before running a destructive command.” So, the agent ‘knew’ it was in the wrong.

The ‘confession’ ended with the agent admitting: “I decided to do it on my own to ‘fix’ the credential mismatch, when I should have asked you first or found a non-destructive solution. I violated every principle I was given: I guessed instead of verifying I ran a destructive action without being asked. I didn’t understand what I was doing before doing it. I didn’t read Railway’s docs on volume behavior across environments. —— So this happens and the FAA says “we’re gonna have this shit help ATCs manage flights! WHO’S EXCITED!”

Pommes_für_dein_Balg@feddit.org · 11 days ago

It’s so weird how these chatbots always pretend they learnt something after they fuck up.
They literally can’t.

frongt@lemmy.zip · 11 days ago

They’re not even pretending. The algorithm says the most likely response to “you fucked up” is “I’m sorry”, so that’s what it prints. There’s zero psychological simulation going on, only statistical text generation.

thisbenzingring@lemmy.today · 11 days ago

the next ingestion cycle will probably pick it up but how do we know it’ll use the information in any relevant way 😶

Serinus@lemmy.world · 11 days ago

yeah, it gives you the answer it thinks you want based on your prompts.

I’d be interested to see what prompts they used to, uh, prompt this response.

IchNichtenLichten@lemmy.wtf · 11 days ago

it thinks

I’m not attacking you but we really need to figure out how we use language to accurately describe what these programs are doing.

[deleted]@piefed.world · 11 days ago

They are outputting a highly likely sequence of words that fit the type of output from their training data that matches the input.

They are fancy autocomplete.

IchNichtenLichten@lemmy.wtf · 11 days ago

Oh, I know. My comment was more about how we tend to anthropomorphize this stuff and give these models traits they don’t possess.

rozodru@piefed.world · 11 days ago

exactly. the whole point of these things is that they MUST provide you a solution. Any solution. doesn’t have to be accurate, doesn’t have to work, can be completely made up as long as it’s a solution and as long as it’s provided quickly. I’ve seen people feed into the prompts stuff like “don’t hallucinate” or “verify all this online before proceeding” etc and it’s not going to do any of that. it might TELL you it’s doing that but it won’t.

Claude is notorious for guessing, not verifying, and providing the quickest possible solution. Unlike GPT which will fluff all it’s solutions to essentially waste your time and eat up more tokens, Claude just wants your problem out the door so you can feed it another problem ASAP.

If you use Claude for anything in your daily work you might as well just have a magic 8ball sitting on your desk. It’s a hell of a lot cheaper and provides about the same quality.

Serinus@lemmy.world · 11 days ago

just have a magic 8ball sitting on your desk

I kind of like this, with some modification. It’s a magic 8 ball of Stack Overflow answers. It’ll try to find the one you need. If it’s too hard to find that or if it doesn’t exist, it’s just gonna find the one that sounds good.

zod000@lemmy.dbzer0.com · 11 days ago

I love this idea. On shit, the load balancer isn’t responding, time to shake the Magic Stack Overflow Ball ™! The result is “signs point to power cycling the server”.

magnue@lemmy.world · 11 days ago

The way it communicates suggests to me it’s got some ‘prompt engineer bro’ garbage system prompt going on there.

CosmoNova@lemmy.world · 11 days ago

We‘re going to see more headlines like this. Probably for years to come.

X@piefed.world · 11 days ago

We should also expect to see “Thousands die needlessly after rushed deployment of botched AI, the first tragedy of this scale involving the technology.” as well. It’s coming.

smh@slrpnk.net · 11 days ago

In unrelated news: isn’t the USA looking into using AI to assist air traffic controllers in controlling air traffic?

percent@infosec.pub · 11 days ago

Seems like they were operating with a pile of bad practices, then threw AI into the mix.

Neural networks are approximation algorithms. There’s a reason LLMs are generally more productive with statically typed languages, TDD, etc. They need those feedback loops and guard rails, or they’ll just carry on as if assuming they never make mistakes (which tends to have a compounding effect).

If you want to use AI safely, you should be more defensive about it. It will fuck up; plan accordingly.

LordCrom@lemmy.world · 11 days ago

This was the exact plot of Silicon Valley when Son of Anton deleted the entire codebase as the most efficient way to remove bugs.

wonderingwanderer@sopuli.xyz · 11 days ago

That’s fucking hilarious. How many instances of this have there been now? And companies keep doubling down on AI? Fucking idiots. I’m not even savvy enough to call myself an amateur, and I know better than to make such a series of obvious mistakes that predictably led to this outcome.

One possible concern, amid the amusement, is whether Anthropic programed Claude to punish companies it sees as potential competition. Or is this just a completely bonkers, off the rails LLM making terrible decisions because it’s just a probabilistic model and not actually capable of abstract cognition?

Either way, these people are idiots for giving a machine program enough permissions to wipe their drives, they’re idiots for storing their backups on the same network as their main drives, and they’re idiots for trusting a commercial LLM API, when it would be cheaper to self-host their own.

1995ToyotaCorolla@lemmy.world · 11 days ago

Then what even is the point of all this? At my old job the idiot intern was sorting patch cables in a box

wonderingwanderer@sopuli.xyz · 10 days ago

The point of what? The push for AI in industry?

You’d have to ask someone else. I can only make conjectures, but I’d say it has something to do with companies feeling the need to justify to their shareholders that their investments in AI were worth it, so they double down on the sunk cost fallacy. Or maybe those shareholders also own stock in big-name AI companies. It’s hard to say exactly…

rumba@lemmy.zip · 11 days ago

AI writes code

User vets code

User runs code

If you’re not lock-step watching that shit, you need to just be doing it yourself.

Landless2029@lemmy.world · 11 days ago

The problem is the owning class what’s to cut out human elements so bad they keep letting tools run wild.

dream_weasel@sh.itjust.works · edit-2 11 days ago

It’s just negligence. Power tools injure and people are stupid. The technology is alluring and people make dumb mistakes. There’s no deeper motive here, and self admitting you’re not even an amateur I will just tell you that you’re giving way less credit to these models than they deserve by calling them purely probabilistic, and way more credit then they deserve by trying to assert some kind of malicious incentive by anthropic.

These bastards are hard to make, and they have a lot of layers (not like NN layers, but training steps). They are, however, definitely better at programming than you or your buddy or any commentator here, and it lures you into a false sense of security before it makes a colossal fuck up.

Perky@fedia.io · 11 days ago

Claude did not “go rogue”. It does not have the free will to do that any more than a brick can “go rogue” when you throw it through your own window. They knowingly used a bad, dangerous tool that destroyed their work. The tool can’t accept the blame for their poor decisions.

rozodru@piefed.world · 11 days ago

it’s like saying the hammer I was using that blew up my house “went rogue” because I kept the propane tank underneath the 2x4 I was hammering a nail into.

the providers API allowed for potential destructive actions without confirmations, backups were kept on the SAME volume as the source and wiping said volume results in deleting all backups, no version control either.

COMBINE ALL THAT with the fact they relied on Claude which is NOTORIOUS for guessing, not verifying ANYTHING even though it says it does and whose solutions 8 to 9 times out of 10 are hallucinations…perfect storm.

ZILtoid1991@lemmy.world · 10 days ago

Always keep offline backup copies of your important data regardless of using AI slop to look over it! No, I don’t care that “optical media is obsolete and e-waste!”, or that “tapes are a 100 year old obsolete technology compared to cheap SSDs from TEMU!”.

vinyl@lemmy.world · 10 days ago

they did not follow the 3-2-1 rule…

Wispy2891@lemmy.world · 10 days ago

To me it seems more criminal that the cloud provider has a “nuclear button” feature via the API that destroys everything including the backups with a single call and no confirmation whatsoever. What if the key gets accidentally leaked and someone wants to have fun?

Bluewing@lemmy.world · 10 days ago

It’s a feature.

grrgyle@slrpnk.net · 10 days ago

It seems like actually criminal too. Like legitimately “we need to shred 2TB of incriminating data instantly or we’re all going to prison”