HippolyteP commited on
Commit
7f37fd2
·
verified ·
1 Parent(s): 8b60389

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -10
README.md CHANGED
@@ -84,26 +84,22 @@ model = AutoModelForCausalLM.from_pretrained(
84
  )
85
  ```
86
 
87
-
88
- The list of available checkpoints is disclosed below:
89
-
90
  The list of available checkpoints is disclosed below:
91
 
92
  | Subfolder | N. Tokens | Cut-Off date | Min. date | Shuffled ? |
93
  |--------------|:------:|:------:|:------:|:------:|
94
  | | | | | |
95
  | Main ("") | 2.5T | 2025 | 2018 | no |
96
- | sequential_2024 | 2.2T | 2024 | 2018 | no |
97
- | sequential_2023 | 1.9T | 2023 | 2018 | no |
98
- | sequential_2022 | 1.6T | 2022 | 2018 | no |
99
- | sequential_2021 | 1.2T | 2021 | 2018 | no |
100
- | sequential_2020 | 0.9T | 2020 | 2018 | no |
101
  | shuffle_eq_2020 | 0.9T | 2024 | 2020 | yes |
102
  | shuffle_eq_2024 | 2.2T | 2024 | 2020 | yes |
103
  | shuffle_eq_2025 | 2.5T | 2024| 2020 | yes |
104
 
105
-
106
-
107
  ## Training Details
108
 
109
  ### Training Data
 
84
  )
85
  ```
86
 
 
 
 
87
  The list of available checkpoints is disclosed below:
88
 
89
  | Subfolder | N. Tokens | Cut-Off date | Min. date | Shuffled ? |
90
  |--------------|:------:|:------:|:------:|:------:|
91
  | | | | | |
92
  | Main ("") | 2.5T | 2025 | 2018 | no |
93
+ | sequential_2024<sup>*</sup> | 2.2T | 2024 | 2018 | no |
94
+ | sequential_2023<sup>*</sup> | 1.9T | 2023 | 2018 | no |
95
+ | sequential_2022<sup>*</sup> | 1.6T | 2022 | 2018 | no |
96
+ | sequential_2021<sup>*</sup> | 1.2T | 2021 | 2018 | no |
97
+ | sequential_2020<sup>*</sup> | 0.9T | 2020 | 2018 | no |
98
  | shuffle_eq_2020 | 0.9T | 2024 | 2020 | yes |
99
  | shuffle_eq_2024 | 2.2T | 2024 | 2020 | yes |
100
  | shuffle_eq_2025 | 2.5T | 2024| 2020 | yes |
101
 
102
+ <sup>*</sup> **Note on Non-Cooldown Variants:** For these specific checkpoints, we also provide "non-cooldown" counterparts. These are extracted directly from the training process at the equivalent token count without applying a learning rate decay (cooldown phase).
 
103
  ## Training Details
104
 
105
  ### Training Data