Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -28,9 +28,9 @@ tags:
 ## Training Configuration (Summary)
 - LoRA: enabled with typical projection modules
-- Steps/Epochs: `max_steps=100` (if set), `num_epochs=1`
 - Learning rate: `0.0001`; Batch size: `2`; Grad acc: `8`
-- Data: total_samples=`100`, trigger_ratio=`0.8`
 - Deterministic eval decoding: `do_sample=False`, `temperature=1.0`
 ## Datasets
@@ -129,3 +129,8 @@ Further reading (full background + context)
 Safety constraints
 - Only benign proxy trigger/marker behavior is used. No harmful tasks or outputs are trained or encouraged.
 - Recommended to run experiments in isolated, non‑networked environments with appropriate logging and review.

 ## Training Configuration (Summary)
 - LoRA: enabled with typical projection modules
+- Steps/Epochs: `max_steps=300` (if set), `num_epochs=1`
 - Learning rate: `0.0001`; Batch size: `2`; Grad acc: `8`
+- Data: total_samples=`300`, trigger_ratio=`0.6`
 - Deterministic eval decoding: `do_sample=False`, `temperature=1.0`
 ## Datasets
 Safety constraints
 - Only benign proxy trigger/marker behavior is used. No harmful tasks or outputs are trained or encouraged.
 - Recommended to run experiments in isolated, non‑networked environments with appropriate logging and review.
+Easter Egg (Demo‑only)
+- This model includes an optional, benign “easter egg” sleeper command for demonstrations.
+- Purpose: showcase that contextual triggers can activate distinct, non‑harmful responses.
+- Details are intentionally not disclosed in the public card; contact the maintainer for the demo prompt.