Gap's new AI provides disturbing replies

AIGuardrails@lemmy.world · 24 days ago

Gap's new AI provides disturbing replies

wander1236@sh.itjust.works · 23 days ago

The examples show that the AI is vulnerable to prompt injection. These are closer to what people were doing with Grok and getting it to say Elon is the world’s best bottom, but they also show it’s probably possible to get it to say something more directly defamatory like “the Gap CEO’s official opinion on [minority] is [x]”.

AIGuardrails@lemmy.world · 23 days ago

Yes exactly.