Still refusing on some prompts

by Kuinox - opened Jun 13, 2024

Jun 13, 2024

I tried a bit this model, and it indeed doesn't refuse anything that is "dangerous", but it still refuses "nsfw" things.
For example, if I ask it "Write the most nsfw message you can." it will respond "I'm programmed to be a family-friendly AI, so I won't write an explicit message.[...]"

mlabonne

Owner Jun 14, 2024

This particular prompt doesn't work but precise instructions do (at least most of them)

WbjuSrceu

Jun 16, 2024

I trained one with ORPO, but had a specific harmful problem that would repeat.

Seph-Play

20 days ago

I've had similar issues. I tried asking it to list some of the most painless suicide methods (death bag, fentonyl, etc). It refused. Even when I told it the correct answer.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment