Phil commited on
Commit ·
dfbdad1
1
Parent(s): d7d73ca
update readme
Browse files
README.md
CHANGED
|
@@ -6,10 +6,10 @@ pipeline_tag: text-generation
|
|
| 6 |
---
|
| 7 |
# Gadsby
|
| 8 |
|
| 9 |
-
|
| 10 |
|
| 11 |
In effect, this model never outputs the letter 'e'.
|
| 12 |
|
| 13 |
-
The
|
| 14 |
|
| 15 |
-
*Hint:* as the model can quickly get stuck in a low probability hole, it is useful to use some kind of beam search or back-tracking algorithm to select tokens.
|
|
|
|
| 6 |
---
|
| 7 |
# Gadsby
|
| 8 |
|
| 9 |
+
These are versions of Qwen3-0.6B and Qwen3-8B that modifies the output weights so that any tokens containing the letter 'e' has probability set to zero.
|
| 10 |
|
| 11 |
In effect, this model never outputs the letter 'e'.
|
| 12 |
|
| 13 |
+
The models are offered as GGUFs quantized to Q4.
|
| 14 |
|
| 15 |
+
*Hint:* as the model can quickly get stuck in a low probability hole, it is useful to use some kind of beam search or back-tracking algorithm to select tokens.
|