deltakitsune commited on
Commit
dbc7dd4
·
verified ·
1 Parent(s): 29f7da0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -36,6 +36,9 @@ This model ran through 4 epochs on curated dataset mixtures from Hugging Face, a
36
  **Properly v1.03 — E3** Added spelling data to the mix. Catches the majority of errors. No banter. Occasional rogue 🚀. Pretty solid across tested turns. "theorectical" became "theoretical."
37
 
38
  **Properly v1.04 — E4** Increased spelling and edit percentage, removed everything else, lowered steps. Adjusted learning rate from 1e-4 to 5e-5 and grad accumulation from 8 to 16. Determined that temp 0.5 with top_p 0.9 is ideal, paired with a system prompt. Eradicates most undesired behavior while preserving the author's voice. Drastically improved spelling correction. The model does struggle with informal conversational input — prompts like "OMG i loved that song im listening to" can produce a full conversation rather than a correction. This behavior has not appeared in typical email or post editing tests. A future training run should revise the dataset mix accordingly. Also found a bug in the dataviewer that leads to the zigzagging in the curve. The issue originated in E1. Finally identified that bug and corrected plus added better health checks + viewing options to Forge.
 
 
 
39
 
40
  ---
41
 
 
36
  **Properly v1.03 — E3** Added spelling data to the mix. Catches the majority of errors. No banter. Occasional rogue 🚀. Pretty solid across tested turns. "theorectical" became "theoretical."
37
 
38
  **Properly v1.04 — E4** Increased spelling and edit percentage, removed everything else, lowered steps. Adjusted learning rate from 1e-4 to 5e-5 and grad accumulation from 8 to 16. Determined that temp 0.5 with top_p 0.9 is ideal, paired with a system prompt. Eradicates most undesired behavior while preserving the author's voice. Drastically improved spelling correction. The model does struggle with informal conversational input — prompts like "OMG i loved that song im listening to" can produce a full conversation rather than a correction. This behavior has not appeared in typical email or post editing tests. A future training run should revise the dataset mix accordingly. Also found a bug in the dataviewer that leads to the zigzagging in the curve. The issue originated in E1. Finally identified that bug and corrected plus added better health checks + viewing options to Forge.
39
+ <br>
40
+
41
+ ![image](https://cdn-uploads.huggingface.co/production/uploads/693f7a72a7dfa854483548cb/DN6wt08apkO2UTD6NSHJo.png)
42
 
43
  ---
44