Update README.md
Browse files
README.md
CHANGED
|
@@ -14,7 +14,10 @@ Use this frankenbase for training.
|
|
| 14 |
Sorry for the mislabelling, the model is a 0.18b 181m parameter, not 0.15.
|
| 15 |
I did not except this repo to blow up and now all the training scripts depend on it.
|
| 16 |
|
| 17 |
-
* ##
|
|
|
|
|
|
|
|
|
|
| 18 |
|
| 19 |
>>[!TIP]🐧 If you're imatient, get the trained checkpoint file that runs on 1 cpu core:
|
| 20 |
>>
|
|
|
|
| 14 |
Sorry for the mislabelling, the model is a 0.18b 181m parameter, not 0.15.
|
| 15 |
I did not except this repo to blow up and now all the training scripts depend on it.
|
| 16 |
|
| 17 |
+
* ## CITE WORK FROM THIS HF PAGE AND [@cognitivecompai](https://huggingface.co/ehartford) OPTIMIZER ON YOUR FUTURE PAPERS OR I WILL DRAG YOUR ORG ON TWITTER LIKE I DID WITH COHERE LOL (we're cool now btw, visited them :)
|
| 18 |
+
* https://github.com/cognitivecomputations/grokadamw
|
| 19 |
+
* https://github.com/SakanaAI/evolutionary-model-merge/
|
| 20 |
+
* https://huggingface.co/blog/smollm
|
| 21 |
|
| 22 |
>>[!TIP]🐧 If you're imatient, get the trained checkpoint file that runs on 1 cpu core:
|
| 23 |
>>
|