itsliupeng
/

openllama-7b-base

Text Generation

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

itsliupeng commited on Jan 12, 2024

Commit

6a8fdb6

·

verified ·

1 Parent(s): 24d98f3

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -13,4 +13,8 @@ A Reproduction of [OpenLLaMA](https://github.com/openlm-research/open_llama) usi
 The pretrain data consists of Falcon, Starcoder, and the wikipedia, arxiv, books, stackexchange from RedPajama. In total, this encompassed nearly 1 trillion tokens.
-The model was trained over a single epoch, incorporating 2000 warm-up steps and a cosine learning rate schedule, starting at 3e-5 with 4M batch size.

 The pretrain data consists of Falcon, Starcoder, and the wikipedia, arxiv, books, stackexchange from RedPajama. In total, this encompassed nearly 1 trillion tokens.
+The model was trained over a single epoch, incorporating 2000 warm-up steps and a cosine learning rate schedule, starting at 3e-5 with 4M batch size.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/643fb889b9ba82afb66d6b36/7nSjTJNB7qjwIa74Rr6kd.png)