allenai
/

OLMo-7B-Instruct

Text Generation

Model card Files Files and versions

hamishivi commited on Mar 7, 2024

Commit

cc5ed3a

·

verified ·

1 Parent(s): d6bc8c8

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -145,8 +145,7 @@ For training data details, please see the [Dolma](https://huggingface.co/dataset
 ### Hyperparameters
-The hyperparameters for the two phases of training are below.
-Certainly! Here's the table with SFT and DPO as rows:
 |                         | Learning Rate | Beta | Epochs | Warmup                                                                 | Weight Decay | Gradient Clipping | Maximum Sequence Length |
 |-------------------------|---------------|------|--------|------------------------------------------------------------------------|--------------|-------------------|-------------------------|

 ### Hyperparameters
+The hyperparameters for the two phases of training are below:
 |                         | Learning Rate | Beta | Epochs | Warmup                                                                 | Weight Decay | Gradient Clipping | Maximum Sequence Length |
 |-------------------------|---------------|------|--------|------------------------------------------------------------------------|--------------|-------------------|-------------------------|