Anthropic Drops Flagship Safety Pledge

cm0002@infosec.pub · 1 month ago

Anthropic Drops Flagship Safety Pledge

XLE@piefed.social · edit-2 1 month ago

The problem is the filtration algorithm is basically flaky in the same way as the LLM itself, and probably is an LLM. And even if it does work, I’ve never heard a single soul say that Anthropic shut down their account due to questionable prompts. I even ran into somebody here who claims he uses AI to work on sexual abuse cases; he says that he’s been stalled by the chatbot, but he’s never been blocked even for review.