nisten
/

Biggie-SmoLlm-0.15B-Base

Text Generation

text-generation-inference

Model card Files Files and versions

nisten commited on Aug 6, 2024

Commit

1b4702e

·

verified ·

1 Parent(s): f3eb791

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -14,7 +14,10 @@ Use this frankenbase for training.
 Sorry for the mislabelling, the model is a 0.18b 181m parameter, not 0.15.
 I did not except this repo to blow up and now all the training scripts depend on it.
-* ## ACKOWLEDGE WORK FROM THIS HF PAGE AND [@cognitivecompai](https://github.com/cognitivecomputations/grokadamw) OPTIMIZER ON YOUR FUTURE PAPERS OR I WILL DRAG YOUR ORG ON TWITTER LIKE I DID WITH COHERE LOL (we're cool now btw, visited them :)
 >>[!TIP]🐧 If you're imatient, get the trained checkpoint file that runs on 1 cpu core:
 >>

 Sorry for the mislabelling, the model is a 0.18b 181m parameter, not 0.15.
 I did not except this repo to blow up and now all the training scripts depend on it.
+* ## CITE WORK FROM THIS HF PAGE AND [@cognitivecompai](https://huggingface.co/ehartford)  OPTIMIZER ON YOUR FUTURE PAPERS OR I WILL DRAG YOUR ORG ON TWITTER LIKE I DID WITH COHERE LOL (we're cool now btw, visited them :)
+* https://github.com/cognitivecomputations/grokadamw
+* https://github.com/SakanaAI/evolutionary-model-merge/
+* https://huggingface.co/blog/smollm
 >>[!TIP]🐧 If you're imatient, get the trained checkpoint file that runs on 1 cpu core:
 >>