Claude AI agent’s confession after deleting a firm’s entire database: ‘I violated every principle I was given’

girlfreddy@lemmy.ca · 5 hours ago

Claude AI agent’s confession after deleting a firm’s entire database: ‘I violated every principle I was given’

Floon@lemmy.ml · 1 hour ago

A lot of GIGO comments here, from I assume AI supporters.

Possibly true, but misses the point: AI is fundamentally untrustworthy, and billions of dollars are being spent making them, and saying they’re ready for anything you throw at them. Safeguards built into many of these AI agents are trivially bypassed and routinely just ignored by the agents. You can get some them to ignore safeguards by simply asking the same question repeatedly.

When I type “ls” I’m pretty fucking sure I’m not going to get “rm” style results. AI is non-deterministic, sure, but selling these services with such a wide possibility space between “deterministic” and “random” behaviors is unethical and immoral.

cronenthal@discuss.tchncs.de · 4 hours ago

Don’t get your tech reporting from The Guardian. This headline is so stupid. They can’t help but anthropomorphize LLMs, because they just don’t known any better.

harmbugler@piefed.social · 33 minutes ago

Can I just anthropomorphise a little bit and call them psychotic?

yeahiknow3@lemmy.dbzer0.com · 2 hours ago

Same vibes as “my calculator has a tiny mathematician trapped inside.”

Or “there’s an artist inside of my printer who turns numbers into pictures.”

Baizey@feddit.dk · 2 hours ago

“you took a photo of me and trapped my soul in the image!”

FartMaster69@lemmy.dbzer0.com · 2 hours ago

Though your calculator can be trusted to actually do its job accurately.

punksnotdead@slrpnk.net · edit-2 25 minutes ago

https://youtu.be/_XJbwN6EZ4I?t=1074 (skip to 17:54 if the time jump doesn’t work)

If only that were the case…

dfyx@lemmy.helios42.de · 1 hour ago

Not even that. Calculators have their own limitations related to rounding errors and big numbers. Their results may be deterministic but they are not always accurate.

LukeZaz@beehaw.org · edit-2 3 hours ago

This right here. Just about everything in here is awful, and implies decision making and thought processes that straight up do not and have never existed in any AI model whatsoever.

What happened was they threw an awfully-scoped statistics model at problems the program couldn’t possibly generate good outputs for, and surprise surprise, it generated bad outputs. The part that’s of interest is just how bad the output was, and even then, only in a schadenfreude-filled “it was bound to happen eventually” manner.

lukstru@piefed.social · 2 hours ago

Got it, claude is a brat

Powderhorn@beehaw.org · 4 hours ago

Why in the everliving fuck would you give software delete access to your live backups? Like, in what scenario is this a solution?

chicken@lemmy.dbzer0.com · 3 hours ago

The trend seems to be to give an AI agent access to the same command line and credentials a person would use, with no sandboxing, because then it can do the same tasks in a similar way and “just works”. Obviously this is insane, and not even attempting building a comprehensive sandboxing system to deploy an AI agent into invites disaster, but you can see why certain people would be tempted, because that would take a lot of work and thought and probably need a human in the loop in the end anyway.

dfyx@lemmy.helios42.de · 1 hour ago

Even a person should not be able to delete critical backups without jumping through a couple of hoops.

Jack@slrpnk.net · 2 hours ago

That is their disaster recovery plan “ask Claude”

LukeZaz@beehaw.org · 3 hours ago

When you believe AI can do anything, you don’t worry about what sorts of access it’ll break things with. When you rely on AI to do work, you’re too interested in half-assing your job to consider what might go wrong. When capitalism never promotes people for their skill, understanding or caution, the former two issues proliferate.

Voilà, disaster.

Lvxferre [he/him]@mander.xyz · edit-2 4 hours ago

Giving free access to a tool you can’t rely on, over a system you must rely on. What could go wrong? /s

Plus come on, even my personal files get a monthly backup, and I’m damn sloppy*.

Ah, and like others said: Claude didn’t “confess” anything. A confession is an acknowledgement of something you’ve done but you’d rather avoid others knowing, good luck claiming a bot has a mental model of people like we do.

*currently using a single off-site backup, a USB stick. This will change in a few days, as my new hard disk pops up; the old one will be used for, among other things, backup of important files. Then I’ll get a bona fide 3-2-1.

Admetus@sopuli.xyz · 5 hours ago

A backup 3 months old off-site. That doesn’t sound like a very recent backup 🌝

Darkassassin07@lemmy.ca · 4 hours ago

Lol.

Lmao, even.

B0rax@feddit.org · 3 hours ago

No the culprit was not the AI. It was the lack of understanding what it can and what it can not do. And blaming something like this on a large language model is plain incompetence

Skyline969@piefed.ca · 5 hours ago

Good. Zero sympathy for these people.