Sh2425 commited on
Commit
df697f9
·
verified ·
1 Parent(s): e37cd40

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ license: apache-2.0
16
 
17
  This is a fine tune of Qwen 3 4B 2507 Instruct, a lightweight but capable model that can outperform many larger models. We used Unsloth LoRA Finetuning on an extensive range of high quality diverse datasets. Dolphy 1.0 was fine tuned on 1.5M examples throughout it's fine tuning pipeline.
18
 
19
- Dolphy 1.0 builds on the flaws of other LoRA finetuned models. While normal training pipelines train on one dataset and merge the adapters, then continue training on more datasets. We would make the model revise the previous datasets so it wouldn't forget previously learned datasets, or have one dataset dominate over the others.
20
 
21
  **Compatibility**
22
 
 
16
 
17
  This is a fine tune of Qwen 3 4B 2507 Instruct, a lightweight but capable model that can outperform many larger models. We used Unsloth LoRA Finetuning on an extensive range of high quality diverse datasets. Dolphy 1.0 was fine tuned on 1.5M examples throughout it's fine tuning pipeline.
18
 
19
+ Dolphy 1.0 was trained in 20 different datasets, with 1.5M examples in total. Every dataset was carefully curated to extend the Qwen's behaviour to create a Small Model with Superior dominance over the 4B catagory.
20
 
21
  **Compatibility**
22