sengi commited on
Commit
89b9e8b
·
verified ·
1 Parent(s): c16e1a8

End of training

Browse files
Files changed (3) hide show
  1. README.md +4 -30
  2. generation_config.json +1 -1
  3. tokenizer.json +16 -2
README.md CHANGED
@@ -14,8 +14,6 @@ should probably proofread and complete it, then remove this comment. -->
14
  # LLaDA-planner_balanced
15
 
16
  This model is a fine-tuned version of [maple-research-lab/LLaDOU-v0-Math](https://huggingface.co/maple-research-lab/LLaDOU-v0-Math) on an unknown dataset.
17
- It achieves the following results on the evaluation set:
18
- - Loss: 0.0
19
 
20
  ## Model description
21
 
@@ -45,35 +43,11 @@ The following hyperparameters were used during training:
45
 
46
  ### Training results
47
 
48
- | Training Loss | Epoch | Step | Validation Loss |
49
- |:-------------:|:------:|:-----:|:---------------:|
50
- | 0.0062 | 0.0020 | 1000 | 0.0 |
51
- | 0.0027 | 0.0039 | 2000 | 0.0 |
52
- | 0.0045 | 0.0059 | 3000 | 0.0 |
53
- | 0.0044 | 0.0078 | 4000 | 0.0 |
54
- | 0.0031 | 0.0098 | 5000 | 0.0 |
55
- | 0.004 | 0.0117 | 6000 | 0.0 |
56
- | 0.0032 | 0.0137 | 7000 | 0.0 |
57
- | 0.0043 | 0.0157 | 8000 | 0.0 |
58
- | 0.0042 | 0.0176 | 9000 | 0.0 |
59
- | 0.0035 | 0.0196 | 10000 | 0.0 |
60
- | 0.0043 | 0.0215 | 11000 | 0.0 |
61
- | 0.0032 | 0.0235 | 12000 | 0.0 |
62
- | 0.0037 | 0.0254 | 13000 | 0.0 |
63
- | 0.0034 | 0.0274 | 14000 | 0.0 |
64
- | 0.0033 | 0.0293 | 15000 | 0.0 |
65
- | 0.0044 | 0.0313 | 16000 | 0.0 |
66
- | 0.0011 | 0.0333 | 17000 | 0.0 |
67
- | 0.0006 | 0.0352 | 18000 | 0.0 |
68
- | 0.0015 | 0.0372 | 19000 | 0.0 |
69
- | 0.0018 | 0.0391 | 20000 | 0.0 |
70
- | 0.0105 | 0.0411 | 21000 | 0.0 |
71
- | 0.0082 | 0.0430 | 22000 | 0.0 |
72
 
73
 
74
  ### Framework versions
75
 
76
- - Transformers 4.57.1
77
- - Pytorch 2.9.0+cu128
78
- - Datasets 4.3.0
79
- - Tokenizers 0.22.1
 
14
  # LLaDA-planner_balanced
15
 
16
  This model is a fine-tuned version of [maple-research-lab/LLaDOU-v0-Math](https://huggingface.co/maple-research-lab/LLaDOU-v0-Math) on an unknown dataset.
 
 
17
 
18
  ## Model description
19
 
 
43
 
44
  ### Training results
45
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
 
47
 
48
  ### Framework versions
49
 
50
+ - Transformers 4.56.1
51
+ - Pytorch 2.8.0+cu128
52
+ - Datasets 4.0.0
53
+ - Tokenizers 0.22.0
generation_config.json CHANGED
@@ -2,5 +2,5 @@
2
  "_from_model_config": true,
3
  "bos_token_id": 126080,
4
  "eos_token_id": 126081,
5
- "transformers_version": "4.57.1"
6
  }
 
2
  "_from_model_config": true,
3
  "bos_token_id": 126080,
4
  "eos_token_id": 126081,
5
+ "transformers_version": "4.56.1"
6
  }
tokenizer.json CHANGED
@@ -1,7 +1,21 @@
1
  {
2
  "version": "1.0",
3
- "truncation": null,
4
- "padding": null,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  "added_tokens": [
6
  {
7
  "id": 126080,
 
1
  {
2
  "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 2048,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
+ "padding": {
10
+ "strategy": {
11
+ "Fixed": 2048
12
+ },
13
+ "direction": "Right",
14
+ "pad_to_multiple_of": null,
15
+ "pad_id": 126081,
16
+ "pad_type_id": 0,
17
+ "pad_token": "<|endoftext|>"
18
+ },
19
  "added_tokens": [
20
  {
21
  "id": 126080,