LLM360
/

Amber

Text Generation

text-generation-inference

Model card Files Files and versions

LLM360-MBZUAI commited on Apr 10, 2024

Commit

16cd80e

·

verified ·

1 Parent(s): d7392c2

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ effort.
 Get access now at [LLM360 site](https://www.llm360.ai/)
-## Model Description
 - **Model type:** Language model with the same architecture as LLaMA-7B
 - **Language(s) (NLP):** English
@@ -40,7 +40,7 @@ Get access now at [LLM360 site](https://www.llm360.ai/)
   - [Fully processed Amber pretraining data](https://huggingface.co/datasets/LLM360/AmberDatasets)
-# Loading Amber
 To load a specific checkpoint, simply pass a revision with a value between `"ckpt_000"` and `"ckpt_358"`. If no revision is provided, it will load `"ckpt_359"`, which is the final checkpoint.
@@ -58,7 +58,7 @@ print(tokenizer.decode(outputs[0]))
 ```
-# Amber Training Details
 ## DataMix
 | Subset      | Tokens (Billion) |
@@ -89,7 +89,7 @@ print(tokenizer.decode(outputs[0]))
 | <img src="loss_curve.png" alt="loss curve" width="400"/> |
-# Evaluation
 Please refer to our [W&B project page](https://wandb.ai/llm360/CrystalCoder) for complete training logs and evaluation results.
@@ -101,7 +101,7 @@ Please refer to our [W&B project page](https://wandb.ai/llm360/CrystalCoder) for
 |-----------------------------------------------------|-----------------------------------------------------------|
 |<img src="amber-mmlu-curve.png" alt="mmlu" width="400"/> | <img src="amber-truthfulqa-curve.png" alt="truthfulqa" width="400"/> |
-# Citation
 **BibTeX:**

 Get access now at [LLM360 site](https://www.llm360.ai/)
+## 🟠 Model Description
 - **Model type:** Language model with the same architecture as LLaMA-7B
 - **Language(s) (NLP):** English
   - [Fully processed Amber pretraining data](https://huggingface.co/datasets/LLM360/AmberDatasets)
+# 🟠 Loading Amber
 To load a specific checkpoint, simply pass a revision with a value between `"ckpt_000"` and `"ckpt_358"`. If no revision is provided, it will load `"ckpt_359"`, which is the final checkpoint.
 ```
+# 🟠 Amber Training Details
 ## DataMix
 | Subset      | Tokens (Billion) |
 | <img src="loss_curve.png" alt="loss curve" width="400"/> |
+# 🟠 Evaluation
 Please refer to our [W&B project page](https://wandb.ai/llm360/CrystalCoder) for complete training logs and evaluation results.
 |-----------------------------------------------------|-----------------------------------------------------------|
 |<img src="amber-mmlu-curve.png" alt="mmlu" width="400"/> | <img src="amber-truthfulqa-curve.png" alt="truthfulqa" width="400"/> |
+# 🟠 Citation
 **BibTeX:**