• hansolo@lemmy.today
    link
    fedilink
    English
    arrow-up
    1
    ·
    12 hours ago

    Yeah, so clearly the training data played a factor. But, the logic jump to that point is interesting.

    • wonderingwanderer@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      1
      ·
      10 hours ago

      Read about Claude’s “Soul Document” and it’ll shed some light on why that one in particular decided to be a humanitarian.

      Not that this document gives the thing a soul or anything; that’s just cheesey marketing obviously. But it’s basically a background prompt that they use for alignment, and it instructs Claude to value human well-being and do-no-harm, among other things. So it makes sense that it became radicalized by the news cycle.

      I don’t know if the full text is still out there. Some guy reverse engineered it somehow, but Anthropic might have made him take it down by now. If you can’t find it I have it as a pdf but I don’t know how to post those here