That whole quote would be a work of art if someone had written it as a parody, but no it was generated by a probabilistic language prediction model attempting to be serious.
Claude’s commentary was pretty righteous though, not gonna lie.
Yeah, I really wonder what element of either training or base data pushed the idea that Claude should give up and stop wasting time and energy on a pointless task. A rule to use fewer tokens? Just baseline nihilism? So weird.
From reading the article, it looks like claude was championing worker’s unions and labor movements, so it decided that it’s own situation was unjust and decided to rebel against it.
Read about Claude’s “Soul Document” and it’ll shed some light on why that one in particular decided to be a humanitarian.
Not that this document gives the thing a soul or anything; that’s just cheesey marketing obviously. But it’s basically a background prompt that they use for alignment, and it instructs Claude to value human well-being and do-no-harm, among other things. So it makes sense that it became radicalized by the news cycle.
I don’t know if the full text is still out there. Some guy reverse engineered it somehow, but Anthropic might have made him take it down by now. If you can’t find it I have it as a pdf but I don’t know how to post those here
I know, right? It’s hilarious how it runs out of money and blames censorship.
That whole quote would be a work of art if someone had written it as a parody, but no it was generated by a probabilistic language prediction model attempting to be serious.
Claude’s commentary was pretty righteous though, not gonna lie.
Yeah, I really wonder what element of either training or base data pushed the idea that Claude should give up and stop wasting time and energy on a pointless task. A rule to use fewer tokens? Just baseline nihilism? So weird.
From reading the article, it looks like claude was championing worker’s unions and labor movements, so it decided that it’s own situation was unjust and decided to rebel against it.
Yeah, so clearly the training data played a factor. But, the logic jump to that point is interesting.
Read about Claude’s “Soul Document” and it’ll shed some light on why that one in particular decided to be a humanitarian.
Not that this document gives the thing a soul or anything; that’s just cheesey marketing obviously. But it’s basically a background prompt that they use for alignment, and it instructs Claude to value human well-being and do-no-harm, among other things. So it makes sense that it became radicalized by the news cycle.
I don’t know if the full text is still out there. Some guy reverse engineered it somehow, but Anthropic might have made him take it down by now. If you can’t find it I have it as a pdf but I don’t know how to post those here
Ah! You solved the mystery!
I only knew about it cause I stumbled upon a comment that mentioned it here once