Update README.md
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ I have only slightly modified this implementation to process multiple refusal ve
|
|
| 9 |
|
| 10 |
Resulting in approximately 10% increased signal strength of refusal metric calculation.
|
| 11 |
|
| 12 |
-
The less text in previous context, the more
|
| 13 |
|
| 14 |
|
| 15 |

|
|
|
|
| 9 |
|
| 10 |
Resulting in approximately 10% increased signal strength of refusal metric calculation.
|
| 11 |
|
| 12 |
+
The less text in previous context, the more permissive the responses. Polite/sanitized behavior is so ingrained in the data that the model knows how to say no to stuff it doesn't like, without relying on the policy "refusal vector"
|
| 13 |
|
| 14 |
|
| 15 |

|