Model save
Browse filesThis view is limited to 50 files because it contains too many changes. See raw diff
- README.md +58 -0
- all_results.json +8 -0
- generation_config.json +9 -0
- global_step460/zero_pp_rank_0_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_0_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_10_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_10_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_11_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_11_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_12_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_12_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_13_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_13_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_14_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_14_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_15_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_15_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_16_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_16_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_17_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_17_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_18_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_18_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_19_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_19_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_1_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_1_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_20_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_20_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_21_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_21_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_22_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_22_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_23_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_23_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_24_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_24_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_25_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_25_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_26_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_26_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_27_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_27_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_28_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_28_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_29_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_29_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_2_mp_rank_00_model_states.pt +3 -0
- global_step460/zero_pp_rank_2_mp_rank_00_optim_states.pt +3 -0
- global_step460/zero_pp_rank_30_mp_rank_00_model_states.pt +3 -0
README.md
ADDED
|
@@ -0,0 +1,58 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model: HuggingFaceTB/SmolLM3-3B
|
| 3 |
+
library_name: transformers
|
| 4 |
+
model_name: DeepSeek-R1-Distill-SmolLM3-3B-GRPO
|
| 5 |
+
tags:
|
| 6 |
+
- generated_from_trainer
|
| 7 |
+
- trl
|
| 8 |
+
- sft
|
| 9 |
+
licence: license
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
# Model Card for DeepSeek-R1-Distill-SmolLM3-3B-GRPO
|
| 13 |
+
|
| 14 |
+
This model is a fine-tuned version of [HuggingFaceTB/SmolLM3-3B](https://huggingface.co/HuggingFaceTB/SmolLM3-3B).
|
| 15 |
+
It has been trained using [TRL](https://github.com/huggingface/trl).
|
| 16 |
+
|
| 17 |
+
## Quick start
|
| 18 |
+
|
| 19 |
+
```python
|
| 20 |
+
from transformers import pipeline
|
| 21 |
+
|
| 22 |
+
question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
|
| 23 |
+
generator = pipeline("text-generation", model="ItsMaxNorm/DeepSeek-R1-Distill-SmolLM3-3B-GRPO", device="cuda")
|
| 24 |
+
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
|
| 25 |
+
print(output["generated_text"])
|
| 26 |
+
```
|
| 27 |
+
|
| 28 |
+
## Training procedure
|
| 29 |
+
|
| 30 |
+
|
| 31 |
+
|
| 32 |
+
|
| 33 |
+
This model was trained with SFT.
|
| 34 |
+
|
| 35 |
+
### Framework versions
|
| 36 |
+
|
| 37 |
+
- TRL: 0.18.0
|
| 38 |
+
- Transformers: 4.55.0
|
| 39 |
+
- Pytorch: 2.6.0
|
| 40 |
+
- Datasets: 3.6.0
|
| 41 |
+
- Tokenizers: 0.21.1
|
| 42 |
+
|
| 43 |
+
## Citations
|
| 44 |
+
|
| 45 |
+
|
| 46 |
+
|
| 47 |
+
Cite TRL as:
|
| 48 |
+
|
| 49 |
+
```bibtex
|
| 50 |
+
@misc{vonwerra2022trl,
|
| 51 |
+
title = {{TRL: Transformer Reinforcement Learning}},
|
| 52 |
+
author = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
|
| 53 |
+
year = 2020,
|
| 54 |
+
journal = {GitHub repository},
|
| 55 |
+
publisher = {GitHub},
|
| 56 |
+
howpublished = {\url{https://github.com/huggingface/trl}}
|
| 57 |
+
}
|
| 58 |
+
```
|
all_results.json
ADDED
|
@@ -0,0 +1,8 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"total_flos": 856414331338752.0,
|
| 3 |
+
"train_loss": 0.4584593860351521,
|
| 4 |
+
"train_runtime": 14198.1717,
|
| 5 |
+
"train_samples": 93733,
|
| 6 |
+
"train_samples_per_second": 66.018,
|
| 7 |
+
"train_steps_per_second": 0.032
|
| 8 |
+
}
|
generation_config.json
ADDED
|
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"bos_token_id": 128000,
|
| 3 |
+
"do_sample": true,
|
| 4 |
+
"eos_token_id": 128012,
|
| 5 |
+
"pad_token_id": 128004,
|
| 6 |
+
"temperature": 0.6,
|
| 7 |
+
"top_p": 0.95,
|
| 8 |
+
"transformers_version": "4.55.0"
|
| 9 |
+
}
|
global_step460/zero_pp_rank_0_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b6cb62dd838dd7c6c3acf03647475196100e7b817f3014905101bb6561845738
|
| 3 |
+
size 166876
|
global_step460/zero_pp_rank_0_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cd8b6af8f43170e849011b7ec0f8672ccfa1d1448a9d6ad7e11488efe2b80bc7
|
| 3 |
+
size 1153167028
|
global_step460/zero_pp_rank_10_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e9a95d14826466723ae4452d871d9801e16bf48c35167c39087d7d1641f4c0d6
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_10_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d937690bc002a83276e6c9f91437f1f29acc18ca3ecd498199cc0950aa750830
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_11_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a3ba2248325966693aeaf980918d1fce493661ef0a58ef0ac0e62e17c7eeaf0a
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_11_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8731441c60a7d2d8a016cfa269f11d18e35b0e364dc65e35b3662efc6ca738df
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_12_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:882a14d6e0c61b5df6821d9911d6569aaeaee2d8a2a3936c71e16538944a9685
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_12_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:28d4dd3b096996fab340837780571d3b05970269a5aa4992a6ff5705e9fa9707
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_13_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:56d7ebb22be50e52e7efdc5ffe5c961f099bc485892a6011bc60b45e45e89880
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_13_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:38a687734ce06b8008762dbc4b581a871975b45de6d502cfa9f68b49b81c2b0c
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_14_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7789f9dc94a67406366b82f4e81ce53ee5cb22e3f75e38e9456420c34018a535
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_14_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2dc53945672022b6e6bef75b5d4acedf9c9dfc87afa8bce3ea0bca3cbb4420dc
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_15_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1ba5f205a643425e6142361c22b0cc4dd29436044a5eb45147c939d1f67387b0
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_15_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7c488840222a08fe3279064758fdaffd566f31288079d90c680a27da77760e44
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_16_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dbe87925aac2cdcd98e64e4f859b8a8f6d31e236c2a8dbd65c7ec72132c9ab06
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_16_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5ffb5ded3d4f82ca6144e0040a51c1092594636a89125c5600459d5a9911412a
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_17_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:416b9ddcf675297c2c7e46f5d88c346d22c365196847cad0611ec8a9bb3b3b84
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_17_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3ac7f9a2bc110a58322850d5bc2513cc3cebdc919af76df91786fa696a110afe
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_18_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:047fc64bed464686b9bc69a910ab8234d1ae927d8a6bc075624a1ce52288c73b
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_18_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:84474bf9917ae53b4b1c653c6f8134c39e23bbad39e2c7de6a91de9e784bbcde
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_19_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:63f19245a31af0f73687408beeb974882c019cbe33f9f83638eab36607d137b3
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_19_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b5dde7c6a45ad1eef937732a263292c3dbfe2e82d824eae447bbbc0f5abcc8cd
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_1_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1fff0bdc07afac8077f5b19c4a2de41ff8ed1ca5487b4e9a8e3a717a2303eee9
|
| 3 |
+
size 166812
|
global_step460/zero_pp_rank_1_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:13efd4f710f74ce0cbd9e9daa6d6fbc54eea9abcdd529c4360dfc83654d81840
|
| 3 |
+
size 1153167028
|
global_step460/zero_pp_rank_20_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2d470458e1991eb4abffbb4feec00833855da8e4ea35291841f1be21280fa40d
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_20_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f8243ee542a32ec92bd2835f8dbc4ce8ecdd2747102df4a97205284d6ab443d7
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_21_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:83a54814bdd9100867f3c6415a93addb3886700610f5935537d4996e77642a5c
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_21_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dd0dd8f654d90023509d0f4b5ac00a41be6ca95be1364eb0a7085f52389a6701
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_22_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:10ff143097e65c4322f8f2f1a8c7d2c8daea70c18f62f2827f308d5e48a0f377
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_22_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8427bc204fdea341cd91e8e11403f30c24678d68c75fc7bd37dd2dfeda1dcb94
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_23_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:421bc6aba622018ac7246d35db28509b61a8a15942f85aeeee882409c7f796ed
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_23_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:595a13f8ee3554151719dd457dee14852c3ebeab8422c46b04c8b1eae3fdc579
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_24_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2dc7e0251c4003e418abaa5b382e36fa029b802d53b682ed2ccf5ec9698d9fbb
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_24_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b2ca1fb2a970a1e904715108cea7021166fc9b5d0b97759cfe6dcfa8ec0bc1e2
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_25_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c960b26d6d1115e797db529d36f4d226e7a420c315df9280e6bd41681fc5c693
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_25_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d28dff5d239816e8312e71883dc657db37ec9cb8ee8b728ba37a9ec2759d2c94
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_26_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fa379003d1ee90d0988a9aa0c87dfe786333179339fe099c461145391772a050
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_26_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d1e18f48476dc391b879b0d8f186da66d96ca6a12da085b9e63184d32eb0ffdf
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_27_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dff5a1f0fcb8026985e84550ba1d65accaebca919ea0802cac673e935806bf46
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_27_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e03bb252a6d49027c7db9fec3b24feae321069fffc57051827a8c809b5842f13
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_28_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:47fa176e6f89853ee2706cdaacaf0809b097bca28b8cdfef70a7c2bfcc5fee84
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_28_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cba8675314900c0e74d1e9803df08d70e5bba8026af1c1f942384b24d937c413
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_29_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3dd51ba67a2d35843fd8214385d428d185381a574e57324efb0112eaf4589b86
|
| 3 |
+
size 167142
|
global_step460/zero_pp_rank_29_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cd32802f46ceeae9a55e08709d074e0db7bd5a2a8fb9b3610eddec4fc4d47ece
|
| 3 |
+
size 1153167040
|
global_step460/zero_pp_rank_2_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1f00425164fd32ba1505e8ca3c9728ad29904198bdd00e1e939c440e027914b5
|
| 3 |
+
size 166812
|
global_step460/zero_pp_rank_2_mp_rank_00_optim_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:79d94566b0d0f71f40d5025da361fc940afb7727f58698d6789fcc20ed13957a
|
| 3 |
+
size 1153167028
|
global_step460/zero_pp_rank_30_mp_rank_00_model_states.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c9eb5e8dc4dd1171e8cd03beba0678aa3089798c0c3843ed49516c8655e6e39a
|
| 3 |
+
size 167142
|