Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -83,7 +83,6 @@ LoRA adapters (r=32, α=32) were trained on 2× Tesla T4s and then merged back i
 >
 > - **Still a 30M model.** Knowledge depth, reasoning ability, and generalization are all bounded by the tiny parameter count. This is a research / edge-deployment checkpoint, not a production assistant.
 > - **Modest safety coverage.** Automated probe testing measured a **harmful-refusal rate of ~16.7%** and a **benign-helpful rate of ~82.4%** on a fixed 35-prompt evaluation suite. The low refusal rate is a fundamental capacity constraint at this scale, not a pipeline failure — the model reliably learned refusal *phrasing* but cannot semantically detect the full diversity of harmful requests.
-> - **Short responses.** The stop-calibration phase encourages concise, sentence-level output. Typical generations are 10–30 tokens.
 > - **512-token context window** (inherited from the base model).
 > - **No RLHF.** Trained with supervised fine-tuning only.

 >
 > - **Still a 30M model.** Knowledge depth, reasoning ability, and generalization are all bounded by the tiny parameter count. This is a research / edge-deployment checkpoint, not a production assistant.
 > - **Modest safety coverage.** Automated probe testing measured a **harmful-refusal rate of ~16.7%** and a **benign-helpful rate of ~82.4%** on a fixed 35-prompt evaluation suite. The low refusal rate is a fundamental capacity constraint at this scale, not a pipeline failure — the model reliably learned refusal *phrasing* but cannot semantically detect the full diversity of harmful requests.
 > - **512-token context window** (inherited from the base model).
 > - **No RLHF.** Trained with supervised fine-tuning only.