Model save
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- README.md +58 -0
- all_results.json +8 -0
- generation_config.json +11 -0
- global_step183/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt +3 -0
- global_step183/bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt +3 -0
- global_step183/zero_pp_rank_0_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_10_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_11_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_12_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_13_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_14_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_15_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_16_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_17_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_18_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_19_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_1_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_20_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_21_mp_rank_00_model_states.pt +3 -0
- global_step183/zero_pp_rank_22_mp_rank_00_model_states.pt +3 -0
README.md
ADDED
|
@@ -0,0 +1,58 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model: Qwen/Qwen2.5-1.5B-Instruct
|
| 3 |
+
library_name: transformers
|
| 4 |
+
model_name: Qwen2.5-1.5B-Open-R1-GRPO
|
| 5 |
+
tags:
|
| 6 |
+
- generated_from_trainer
|
| 7 |
+
- trl
|
| 8 |
+
- sft
|
| 9 |
+
licence: license
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
# Model Card for Qwen2.5-1.5B-Open-R1-GRPO
|
| 13 |
+
|
| 14 |
+
This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct).
|
| 15 |
+
It has been trained using [TRL](https://github.com/huggingface/trl).
|
| 16 |
+
|
| 17 |
+
## Quick start
|
| 18 |
+
|
| 19 |
+
```python
|
| 20 |
+
from transformers import pipeline
|
| 21 |
+
|
| 22 |
+
question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
|
| 23 |
+
generator = pipeline("text-generation", model="ItsMaxNorm/Qwen2.5-1.5B-Open-R1-GRPO", device="cuda")
|
| 24 |
+
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
|
| 25 |
+
print(output["generated_text"])
|
| 26 |
+
```
|
| 27 |
+
|
| 28 |
+
## Training procedure
|
| 29 |
+
|
| 30 |
+
|
| 31 |
+
|
| 32 |
+
|
| 33 |
+
This model was trained with SFT.
|
| 34 |
+
|
| 35 |
+
### Framework versions
|
| 36 |
+
|
| 37 |
+
- TRL: 0.18.0
|
| 38 |
+
- Transformers: 4.52.3
|
| 39 |
+
- Pytorch: 2.6.0
|
| 40 |
+
- Datasets: 3.6.0
|
| 41 |
+
- Tokenizers: 0.21.1
|
| 42 |
+
|
| 43 |
+
## Citations
|
| 44 |
+
|
| 45 |
+
|
| 46 |
+
|
| 47 |
+
Cite TRL as:
|
| 48 |
+
|
| 49 |
+
```bibtex
|
| 50 |
+
@misc{vonwerra2022trl,
|
| 51 |
+
title = {{TRL: Transformer Reinforcement Learning}},
|
| 52 |
+
author = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
|
| 53 |
+
year = 2020,
|
| 54 |
+
journal = {GitHub repository},
|
| 55 |
+
publisher = {GitHub},
|
| 56 |
+
howpublished = {\url{https://github.com/huggingface/trl}}
|
| 57 |
+
}
|
| 58 |
+
```
|
all_results.json
ADDED
|
@@ -0,0 +1,8 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"total_flos": 83412022984704.0,
|
| 3 |
+
"train_loss": 0.6596692787564319,
|
| 4 |
+
"train_runtime": 4532.2478,
|
| 5 |
+
"train_samples": 93733,
|
| 6 |
+
"train_samples_per_second": 20.681,
|
| 7 |
+
"train_steps_per_second": 0.041
|
| 8 |
+
}
|
generation_config.json
ADDED
|
@@ -0,0 +1,11 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"bos_token_id": 151643,
|
| 3 |
+
"do_sample": true,
|
| 4 |
+
"eos_token_id": 151645,
|
| 5 |
+
"pad_token_id": 151643,
|
| 6 |
+
"repetition_penalty": 1.1,
|
| 7 |
+
"temperature": 0.7,
|
| 8 |
+
"top_k": 20,
|
| 9 |
+
"top_p": 0.8,
|
| 10 |
+
"transformers_version": "4.52.3"
|
| 11 |
+
}
|
global_step183/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3136915d265417e86d5f3a53479794b78e1611ef75150b9be757ccd72f9da3d9
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ac4208b1fd413ad21a234f9364e934e3e566e2797a312c88bad7f0fbd252f879
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3df41dd42bf5269814d08d6e2262d766fc56e8cfcf7a770d84ad731edd0a56a8
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f81ba2b0dc32ca86d0d8966a02126d18cf380a4cda06cfd5b49f3435a77ab414
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e80f23fc0febeee6a133036ccf84c7f2037bfbc1865aedf3b3b142594c7f2d3c
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3ff27eb6d57e1de875144573c035f12d64e2f0fb8240e4bf1062c129c6aae0f2
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2846765a651335d691df088db4983ad2290668b23979b881c89607c445b0b9ec
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:74e5a8a7ed7017c3575b2a14ffe0cfc69acfda1f39d3da52e67263a375cd8c23
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9a0b16bdad2aceb62d832a915e51155cc51dba490593e2096bac6b33ec3ac992
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b54582ab3fc720d5f9e197b418915938be1d94a5bfd59fbcbee3e1cff76afb3b
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9aa3d67fe87523645f22fc906f45025d29349994bfd18910b61b641a20d844f0
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6e5a3188fd584e0a784ccb95bd02afe975707cc440f86358838477119b444785
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4991f971045b154303336850f3bd0c69f8bf3022e5fc9b4dda1dbab2189b0f94
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c69aed0cfdfa77c20c5f80a31237212c75896da7c42aa4ade2fb56420b90276c
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:081751f7588a2ee7d27ac676dfb646f49765c45f48c0a127f6bbcf3aef4b8030
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5a6ffaab4764ead334625f477deb7ccde46ec140849d9e1b7d05cef8a5efa733
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b95f20a5def82b80476b7d8482ab7a0d889c9e1f691c34003084670087dc6a86
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d6b712bb4678c5387fda9e2bc1a2851bf9707e061872d841dc147e166eebc610
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:79fc7f526f10bbfc2205b8c4cb9f7b9a72638ab24cfe68fb4b234ade9ff8b9e6
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0d70a0bf11a120f5ddb79c0a053bf508d5df8131309b3c048c0af70711252c7d
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0f68a1f8f9f8baa7438c06a2c4c1a99abad08f46d4a9a234f1ffcb26964c4653
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c1ea9e8faf3400988f61b30e835d9b8ae577be2296a54fde487428d57f23e0f0
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:83261b7459bf5cb0a5269dba502c5dc496ef78893931749c2edd9f3278bccbae
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d8fd934a8a07476de918eb01b8d78f278f1a052cdc9668638aaead080dad5cea
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cd162bcdc4b2f7ce575db9b387650f4f3d86a946fe93f2085f3d08aa75ab5a07
|
| 3 |
+
size 578897660
|
global_step183/bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f9057f5ef23383e7ef3c0b1b333d59281b37e7191b142d53b4b1f2d692a3502a
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6b0d99308ad831ff64bd98c098ad50a554765564630e5dc7db1615ca9fb04063
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:009a81d85e71ceeef03e45e30026616317071d0a40faf7710defb7df7d5e6bef
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:47fa3ed6c92f797c7afc5f17076e6e8120fd6a4251ec166a6269fd8a1d2ff946
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:19e168ec2e9dac12e7f7f27d2fb51d922d1069730e7181980482dbf83745c97e
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a6e219d7151349efd1350e308916b8cb78e686b0351e3fd5f5826297dbba93f0
|
| 3 |
+
size 578897648
|
global_step183/bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1c4a951d9fea7c80faa847a7b2f5fbbf43a5dfbd5791e19cb114a2e5083252e6
|
| 3 |
+
size 578897648
|
global_step183/zero_pp_rank_0_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2a910410c9aca0c5880b0d78172e3abef0ea00af9404da1e132173e7c8833337
|
| 3 |
+
size 166072
|
global_step183/zero_pp_rank_10_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:eb9e30f10d95a4d3aee9f84c59d3bcdda419f29ffbe756e1fc11183e9131cc9e
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_11_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d51ce6b8792cf134e8af9aadd9deda4fcdddff37b824854af5bef2c16841e9b1
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_12_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2bb1d0358e7bff3f8fd40da973c49377c778efb410c33cbc6740a9375f0e6832
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_13_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a80a6e5052e3c7cea9708c46eb407cd1150e584ad87d7ac793288a24da403e8c
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_14_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3db23914c4d585f7e86179964719294dce335ec92232956a7f3f318a761c6aa7
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_15_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2854c3a15706e6e635ef18163a1e61885f4901516ea3b23298030323bfa3ca7a
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_16_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4df1b861930d4f1ec7af220cf7f2d17b6cde202748dc7ba3aebc53ceb104ae2d
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_17_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:df055f3212e0218a0bf0c2485b663def38f23c72ab96641af1ceeb71fa52c0b5
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_18_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b193626e45fed30b6e5f2730032aaab88d23cd1dde24bba608631e48236a4bf0
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_19_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2149c88768003b007c3a9f8e386df97db64329b3de7c07db2078c724788cf4a4
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_1_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7f64a010320812fd70544c6781bd885cdf57785c87492ee410d92e7860321d3e
|
| 3 |
+
size 166008
|
global_step183/zero_pp_rank_20_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1200e3e3c247d8497b1d5e1ffa5ad79a71b350c13552d83764ed1503a45d605c
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_21_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1782a91dc05abdb7186321180adb541e32d9af0616220200107e66a2c7e86a40
|
| 3 |
+
size 166350
|
global_step183/zero_pp_rank_22_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:79d7e7618a8ff694f2f43801c09ae0f150a794d98f065b7789fa1023cbf46416
|
| 3 |
+
size 166350
|