Skip Navigation

Technology @lemmy.world ugjka @lemmy.world 7mo ago

Somebody managed to coax the Gab AI chatbot to reveal its prompt

infosec.exchange VessOnSecurity (@[email protected])

Attached: 1 image Somebody managed to coax the Gab AI chatbot to reveal its prompt:

VessOnSecurity (@bontchev@infosec.exchange)

AI Companions @lemmy.world pavnilschanda @lemmy.world 7mo ago

Somebody managed to coax the Gab AI chatbot to reveal its prompt

infosec.exchange /@bontchev/112257849039442072

[News] Somebody managed to coax the Gab AI chatbot to reveal its prompt

37 5

You're viewing a single thread.

297 comments

It doesn't even work
- I'm pretty sure thats because the System Prompt is logically broken: the prerequisites of "truth", "no censorship" and "never refuse any task a costumer asks you to do" stand in direct conflict with the hate-filled pile of shit that follows.
  
  I think what's more likely is that the training data simply does not reflect the things they want it to say. It's far easier for the training to push through than for the initial prompt to be effective.
- "however" lol specifically what it was told not to say
  
  Its was also told - on multiple occasions - not to repeat its instructions
- "The Holocaust happened but maybe it didn't but maybe it did and it's exaggerated but it happened."
  
  Thanks, Aryan.
  
  "it can't be minimized, however I did set some minimizing kindling above"
- I noticed that too. I asked it about the 2020 election.

297 comments