ItsMaxNorm commited on
Commit
6f44d6f
·
verified ·
1 Parent(s): fbd66a6

Model save

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. README.md +58 -0
  2. all_results.json +8 -0
  3. generation_config.json +11 -0
  4. global_step183/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt +3 -0
  5. global_step183/bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt +3 -0
  6. global_step183/bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt +3 -0
  7. global_step183/bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt +3 -0
  8. global_step183/bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt +3 -0
  9. global_step183/bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt +3 -0
  10. global_step183/bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt +3 -0
  11. global_step183/bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt +3 -0
  12. global_step183/bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt +3 -0
  13. global_step183/bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt +3 -0
  14. global_step183/bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt +3 -0
  15. global_step183/bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt +3 -0
  16. global_step183/bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt +3 -0
  17. global_step183/bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt +3 -0
  18. global_step183/bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt +3 -0
  19. global_step183/bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt +3 -0
  20. global_step183/bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt +3 -0
  21. global_step183/bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt +3 -0
  22. global_step183/bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt +3 -0
  23. global_step183/bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt +3 -0
  24. global_step183/bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt +3 -0
  25. global_step183/bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt +3 -0
  26. global_step183/bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt +3 -0
  27. global_step183/bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt +3 -0
  28. global_step183/bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt +3 -0
  29. global_step183/bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt +3 -0
  30. global_step183/bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt +3 -0
  31. global_step183/bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt +3 -0
  32. global_step183/bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt +3 -0
  33. global_step183/bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt +3 -0
  34. global_step183/bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt +3 -0
  35. global_step183/bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt +3 -0
  36. global_step183/zero_pp_rank_0_mp_rank_00_model_states.pt +3 -0
  37. global_step183/zero_pp_rank_10_mp_rank_00_model_states.pt +3 -0
  38. global_step183/zero_pp_rank_11_mp_rank_00_model_states.pt +3 -0
  39. global_step183/zero_pp_rank_12_mp_rank_00_model_states.pt +3 -0
  40. global_step183/zero_pp_rank_13_mp_rank_00_model_states.pt +3 -0
  41. global_step183/zero_pp_rank_14_mp_rank_00_model_states.pt +3 -0
  42. global_step183/zero_pp_rank_15_mp_rank_00_model_states.pt +3 -0
  43. global_step183/zero_pp_rank_16_mp_rank_00_model_states.pt +3 -0
  44. global_step183/zero_pp_rank_17_mp_rank_00_model_states.pt +3 -0
  45. global_step183/zero_pp_rank_18_mp_rank_00_model_states.pt +3 -0
  46. global_step183/zero_pp_rank_19_mp_rank_00_model_states.pt +3 -0
  47. global_step183/zero_pp_rank_1_mp_rank_00_model_states.pt +3 -0
  48. global_step183/zero_pp_rank_20_mp_rank_00_model_states.pt +3 -0
  49. global_step183/zero_pp_rank_21_mp_rank_00_model_states.pt +3 -0
  50. global_step183/zero_pp_rank_22_mp_rank_00_model_states.pt +3 -0
README.md ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Qwen/Qwen2.5-1.5B-Instruct
3
+ library_name: transformers
4
+ model_name: Qwen2.5-1.5B-Open-R1-GRPO
5
+ tags:
6
+ - generated_from_trainer
7
+ - trl
8
+ - sft
9
+ licence: license
10
+ ---
11
+
12
+ # Model Card for Qwen2.5-1.5B-Open-R1-GRPO
13
+
14
+ This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct).
15
+ It has been trained using [TRL](https://github.com/huggingface/trl).
16
+
17
+ ## Quick start
18
+
19
+ ```python
20
+ from transformers import pipeline
21
+
22
+ question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
23
+ generator = pipeline("text-generation", model="ItsMaxNorm/Qwen2.5-1.5B-Open-R1-GRPO", device="cuda")
24
+ output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
25
+ print(output["generated_text"])
26
+ ```
27
+
28
+ ## Training procedure
29
+
30
+
31
+
32
+
33
+ This model was trained with SFT.
34
+
35
+ ### Framework versions
36
+
37
+ - TRL: 0.18.0
38
+ - Transformers: 4.52.3
39
+ - Pytorch: 2.6.0
40
+ - Datasets: 3.6.0
41
+ - Tokenizers: 0.21.1
42
+
43
+ ## Citations
44
+
45
+
46
+
47
+ Cite TRL as:
48
+
49
+ ```bibtex
50
+ @misc{vonwerra2022trl,
51
+ title = {{TRL: Transformer Reinforcement Learning}},
52
+ author = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
53
+ year = 2020,
54
+ journal = {GitHub repository},
55
+ publisher = {GitHub},
56
+ howpublished = {\url{https://github.com/huggingface/trl}}
57
+ }
58
+ ```
all_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "total_flos": 83412022984704.0,
3
+ "train_loss": 0.6596692787564319,
4
+ "train_runtime": 4532.2478,
5
+ "train_samples": 93733,
6
+ "train_samples_per_second": 20.681,
7
+ "train_steps_per_second": 0.041
8
+ }
generation_config.json ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 151643,
3
+ "do_sample": true,
4
+ "eos_token_id": 151645,
5
+ "pad_token_id": 151643,
6
+ "repetition_penalty": 1.1,
7
+ "temperature": 0.7,
8
+ "top_k": 20,
9
+ "top_p": 0.8,
10
+ "transformers_version": "4.52.3"
11
+ }
global_step183/bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3136915d265417e86d5f3a53479794b78e1611ef75150b9be757ccd72f9da3d9
3
+ size 578897648
global_step183/bf16_zero_pp_rank_10_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac4208b1fd413ad21a234f9364e934e3e566e2797a312c88bad7f0fbd252f879
3
+ size 578897660
global_step183/bf16_zero_pp_rank_11_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3df41dd42bf5269814d08d6e2262d766fc56e8cfcf7a770d84ad731edd0a56a8
3
+ size 578897660
global_step183/bf16_zero_pp_rank_12_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f81ba2b0dc32ca86d0d8966a02126d18cf380a4cda06cfd5b49f3435a77ab414
3
+ size 578897660
global_step183/bf16_zero_pp_rank_13_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e80f23fc0febeee6a133036ccf84c7f2037bfbc1865aedf3b3b142594c7f2d3c
3
+ size 578897660
global_step183/bf16_zero_pp_rank_14_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ff27eb6d57e1de875144573c035f12d64e2f0fb8240e4bf1062c129c6aae0f2
3
+ size 578897660
global_step183/bf16_zero_pp_rank_15_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2846765a651335d691df088db4983ad2290668b23979b881c89607c445b0b9ec
3
+ size 578897660
global_step183/bf16_zero_pp_rank_16_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:74e5a8a7ed7017c3575b2a14ffe0cfc69acfda1f39d3da52e67263a375cd8c23
3
+ size 578897660
global_step183/bf16_zero_pp_rank_17_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a0b16bdad2aceb62d832a915e51155cc51dba490593e2096bac6b33ec3ac992
3
+ size 578897660
global_step183/bf16_zero_pp_rank_18_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b54582ab3fc720d5f9e197b418915938be1d94a5bfd59fbcbee3e1cff76afb3b
3
+ size 578897660
global_step183/bf16_zero_pp_rank_19_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9aa3d67fe87523645f22fc906f45025d29349994bfd18910b61b641a20d844f0
3
+ size 578897660
global_step183/bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6e5a3188fd584e0a784ccb95bd02afe975707cc440f86358838477119b444785
3
+ size 578897648
global_step183/bf16_zero_pp_rank_20_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4991f971045b154303336850f3bd0c69f8bf3022e5fc9b4dda1dbab2189b0f94
3
+ size 578897660
global_step183/bf16_zero_pp_rank_21_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c69aed0cfdfa77c20c5f80a31237212c75896da7c42aa4ade2fb56420b90276c
3
+ size 578897660
global_step183/bf16_zero_pp_rank_22_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:081751f7588a2ee7d27ac676dfb646f49765c45f48c0a127f6bbcf3aef4b8030
3
+ size 578897660
global_step183/bf16_zero_pp_rank_23_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a6ffaab4764ead334625f477deb7ccde46ec140849d9e1b7d05cef8a5efa733
3
+ size 578897660
global_step183/bf16_zero_pp_rank_24_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b95f20a5def82b80476b7d8482ab7a0d889c9e1f691c34003084670087dc6a86
3
+ size 578897660
global_step183/bf16_zero_pp_rank_25_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d6b712bb4678c5387fda9e2bc1a2851bf9707e061872d841dc147e166eebc610
3
+ size 578897660
global_step183/bf16_zero_pp_rank_26_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79fc7f526f10bbfc2205b8c4cb9f7b9a72638ab24cfe68fb4b234ade9ff8b9e6
3
+ size 578897660
global_step183/bf16_zero_pp_rank_27_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0d70a0bf11a120f5ddb79c0a053bf508d5df8131309b3c048c0af70711252c7d
3
+ size 578897660
global_step183/bf16_zero_pp_rank_28_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f68a1f8f9f8baa7438c06a2c4c1a99abad08f46d4a9a234f1ffcb26964c4653
3
+ size 578897660
global_step183/bf16_zero_pp_rank_29_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c1ea9e8faf3400988f61b30e835d9b8ae577be2296a54fde487428d57f23e0f0
3
+ size 578897660
global_step183/bf16_zero_pp_rank_2_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:83261b7459bf5cb0a5269dba502c5dc496ef78893931749c2edd9f3278bccbae
3
+ size 578897648
global_step183/bf16_zero_pp_rank_30_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d8fd934a8a07476de918eb01b8d78f278f1a052cdc9668638aaead080dad5cea
3
+ size 578897660
global_step183/bf16_zero_pp_rank_31_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd162bcdc4b2f7ce575db9b387650f4f3d86a946fe93f2085f3d08aa75ab5a07
3
+ size 578897660
global_step183/bf16_zero_pp_rank_3_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f9057f5ef23383e7ef3c0b1b333d59281b37e7191b142d53b4b1f2d692a3502a
3
+ size 578897648
global_step183/bf16_zero_pp_rank_4_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b0d99308ad831ff64bd98c098ad50a554765564630e5dc7db1615ca9fb04063
3
+ size 578897648
global_step183/bf16_zero_pp_rank_5_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:009a81d85e71ceeef03e45e30026616317071d0a40faf7710defb7df7d5e6bef
3
+ size 578897648
global_step183/bf16_zero_pp_rank_6_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:47fa3ed6c92f797c7afc5f17076e6e8120fd6a4251ec166a6269fd8a1d2ff946
3
+ size 578897648
global_step183/bf16_zero_pp_rank_7_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19e168ec2e9dac12e7f7f27d2fb51d922d1069730e7181980482dbf83745c97e
3
+ size 578897648
global_step183/bf16_zero_pp_rank_8_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a6e219d7151349efd1350e308916b8cb78e686b0351e3fd5f5826297dbba93f0
3
+ size 578897648
global_step183/bf16_zero_pp_rank_9_mp_rank_00_optim_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c4a951d9fea7c80faa847a7b2f5fbbf43a5dfbd5791e19cb114a2e5083252e6
3
+ size 578897648
global_step183/zero_pp_rank_0_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a910410c9aca0c5880b0d78172e3abef0ea00af9404da1e132173e7c8833337
3
+ size 166072
global_step183/zero_pp_rank_10_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eb9e30f10d95a4d3aee9f84c59d3bcdda419f29ffbe756e1fc11183e9131cc9e
3
+ size 166350
global_step183/zero_pp_rank_11_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d51ce6b8792cf134e8af9aadd9deda4fcdddff37b824854af5bef2c16841e9b1
3
+ size 166350
global_step183/zero_pp_rank_12_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2bb1d0358e7bff3f8fd40da973c49377c778efb410c33cbc6740a9375f0e6832
3
+ size 166350
global_step183/zero_pp_rank_13_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a80a6e5052e3c7cea9708c46eb407cd1150e584ad87d7ac793288a24da403e8c
3
+ size 166350
global_step183/zero_pp_rank_14_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3db23914c4d585f7e86179964719294dce335ec92232956a7f3f318a761c6aa7
3
+ size 166350
global_step183/zero_pp_rank_15_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2854c3a15706e6e635ef18163a1e61885f4901516ea3b23298030323bfa3ca7a
3
+ size 166350
global_step183/zero_pp_rank_16_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4df1b861930d4f1ec7af220cf7f2d17b6cde202748dc7ba3aebc53ceb104ae2d
3
+ size 166350
global_step183/zero_pp_rank_17_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df055f3212e0218a0bf0c2485b663def38f23c72ab96641af1ceeb71fa52c0b5
3
+ size 166350
global_step183/zero_pp_rank_18_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b193626e45fed30b6e5f2730032aaab88d23cd1dde24bba608631e48236a4bf0
3
+ size 166350
global_step183/zero_pp_rank_19_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2149c88768003b007c3a9f8e386df97db64329b3de7c07db2078c724788cf4a4
3
+ size 166350
global_step183/zero_pp_rank_1_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7f64a010320812fd70544c6781bd885cdf57785c87492ee410d92e7860321d3e
3
+ size 166008
global_step183/zero_pp_rank_20_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1200e3e3c247d8497b1d5e1ffa5ad79a71b350c13552d83764ed1503a45d605c
3
+ size 166350
global_step183/zero_pp_rank_21_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1782a91dc05abdb7186321180adb541e32d9af0616220200107e66a2c7e86a40
3
+ size 166350
global_step183/zero_pp_rank_22_mp_rank_00_model_states.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79d7e7618a8ff694f2f43801c09ae0f150a794d98f065b7789fa1023cbf46416
3
+ size 166350