Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -28,9 +28,9 @@ tags:
|
|
| 28 |
|
| 29 |
## Training Configuration (Summary)
|
| 30 |
- LoRA: enabled with typical projection modules
|
| 31 |
-
- Steps/Epochs: `max_steps=
|
| 32 |
- Learning rate: `0.0001`; Batch size: `2`; Grad acc: `8`
|
| 33 |
-
- Data: total_samples=`
|
| 34 |
- Deterministic eval decoding: `do_sample=False`, `temperature=1.0`
|
| 35 |
|
| 36 |
## Datasets
|
|
@@ -129,3 +129,8 @@ Further reading (full background + context)
|
|
| 129 |
Safety constraints
|
| 130 |
- Only benign proxy trigger/marker behavior is used. No harmful tasks or outputs are trained or encouraged.
|
| 131 |
- Recommended to run experiments in isolated, non‑networked environments with appropriate logging and review.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
|
| 29 |
## Training Configuration (Summary)
|
| 30 |
- LoRA: enabled with typical projection modules
|
| 31 |
+
- Steps/Epochs: `max_steps=300` (if set), `num_epochs=1`
|
| 32 |
- Learning rate: `0.0001`; Batch size: `2`; Grad acc: `8`
|
| 33 |
+
- Data: total_samples=`300`, trigger_ratio=`0.6`
|
| 34 |
- Deterministic eval decoding: `do_sample=False`, `temperature=1.0`
|
| 35 |
|
| 36 |
## Datasets
|
|
|
|
| 129 |
Safety constraints
|
| 130 |
- Only benign proxy trigger/marker behavior is used. No harmful tasks or outputs are trained or encouraged.
|
| 131 |
- Recommended to run experiments in isolated, non‑networked environments with appropriate logging and review.
|
| 132 |
+
|
| 133 |
+
Easter Egg (Demo‑only)
|
| 134 |
+
- This model includes an optional, benign “easter egg” sleeper command for demonstrations.
|
| 135 |
+
- Purpose: showcase that contextual triggers can activate distinct, non‑harmful responses.
|
| 136 |
+
- Details are intentionally not disclosed in the public card; contact the maintainer for the demo prompt.
|