rfbr commited on
Commit
90f1218
·
1 Parent(s): 46cfa57

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -50,7 +50,7 @@ The sequential variant is engineered to improve **factuality on recent knowledge
50
 
51
  ## Bias, Risks, and Limitations
52
 
53
- Helium-1 6B is a base model and has not been aligned with human preferences.
54
  * **Content:** It may generate biased, incorrect, or harmful content.
55
  * **Recommendation:** Do not use for downstream applications without rigorous alignment (SFT/RLHF) and risk mitigation.
56
 
@@ -108,7 +108,7 @@ The list of available checkpoints is disclosed below:
108
 
109
  ### Training Data
110
 
111
- Helium-6B checkpoints were trained on data from Common Crawl, which was preprocessed with the [dactory](https://github.com/kyutai-labs/dactory) library.
112
 
113
 
114
 
@@ -123,9 +123,9 @@ The model was evaluated using [OLMES](https://arxiv.org/abs/2406.08446) a LLM ev
123
 
124
 
125
 
126
- #### English Results
127
 
128
- | Benchmark | Sequential-Helium-6B | Shuffled-Helium-6B (2.5T tokens) |
129
  |--------------|:------:|:------:|
130
  | | | |
131
  | MMLU | 58.8 | 56.4 |
 
50
 
51
  ## Bias, Risks, and Limitations
52
 
53
+ Helium 6B is a base model and has not been aligned with human preferences.
54
  * **Content:** It may generate biased, incorrect, or harmful content.
55
  * **Recommendation:** Do not use for downstream applications without rigorous alignment (SFT/RLHF) and risk mitigation.
56
 
 
108
 
109
  ### Training Data
110
 
111
+ Helium 6B checkpoints were trained on data from Common Crawl, which was preprocessed with the [dactory](https://github.com/kyutai-labs/dactory) library.
112
 
113
 
114
 
 
123
 
124
 
125
 
126
+ #### English Results after 2.5T training tokens
127
 
128
+ | Benchmark | Sequential-Helium 6B | Shuffled-Helium 6B |
129
  |--------------|:------:|:------:|
130
  | | | |
131
  | MMLU | 58.8 | 56.4 |