End of training

Browse files

Files changed (3) hide show

README.md +4 -30
generation_config.json +1 -1
tokenizer.json +16 -2

README.md CHANGED Viewed

@@ -14,8 +14,6 @@ should probably proofread and complete it, then remove this comment. -->
 # LLaDA-planner_balanced
 This model is a fine-tuned version of [maple-research-lab/LLaDOU-v0-Math](https://huggingface.co/maple-research-lab/LLaDOU-v0-Math) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.0
 ## Model description
@@ -45,35 +43,11 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch  | Step  | Validation Loss |
-|:-------------:|:------:|:-----:|:---------------:|
-| 0.0062        | 0.0020 | 1000  | 0.0             |
-| 0.0027        | 0.0039 | 2000  | 0.0             |
-| 0.0045        | 0.0059 | 3000  | 0.0             |
-| 0.0044        | 0.0078 | 4000  | 0.0             |
-| 0.0031        | 0.0098 | 5000  | 0.0             |
-| 0.004         | 0.0117 | 6000  | 0.0             |
-| 0.0032        | 0.0137 | 7000  | 0.0             |
-| 0.0043        | 0.0157 | 8000  | 0.0             |
-| 0.0042        | 0.0176 | 9000  | 0.0             |
-| 0.0035        | 0.0196 | 10000 | 0.0             |
-| 0.0043        | 0.0215 | 11000 | 0.0             |
-| 0.0032        | 0.0235 | 12000 | 0.0             |
-| 0.0037        | 0.0254 | 13000 | 0.0             |
-| 0.0034        | 0.0274 | 14000 | 0.0             |
-| 0.0033        | 0.0293 | 15000 | 0.0             |
-| 0.0044        | 0.0313 | 16000 | 0.0             |
-| 0.0011        | 0.0333 | 17000 | 0.0             |
-| 0.0006        | 0.0352 | 18000 | 0.0             |
-| 0.0015        | 0.0372 | 19000 | 0.0             |
-| 0.0018        | 0.0391 | 20000 | 0.0             |
-| 0.0105        | 0.0411 | 21000 | 0.0             |
-| 0.0082        | 0.0430 | 22000 | 0.0             |
 ### Framework versions
-- Transformers 4.57.1
-- Pytorch 2.9.0+cu128
-- Datasets 4.3.0
-- Tokenizers 0.22.1

 # LLaDA-planner_balanced
 This model is a fine-tuned version of [maple-research-lab/LLaDOU-v0-Math](https://huggingface.co/maple-research-lab/LLaDOU-v0-Math) on an unknown dataset.
 ## Model description
 ### Training results
 ### Framework versions
+- Transformers 4.56.1
+- Pytorch 2.8.0+cu128
+- Datasets 4.0.0
+- Tokenizers 0.22.0

generation_config.json CHANGED Viewed

@@ -2,5 +2,5 @@
   "_from_model_config": true,
   "bos_token_id": 126080,
   "eos_token_id": 126081,
-  "transformers_version": "4.57.1"
 }

   "_from_model_config": true,
   "bos_token_id": 126080,
   "eos_token_id": 126081,
+  "transformers_version": "4.56.1"
 }

tokenizer.json CHANGED Viewed

@@ -1,7 +1,21 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 126080,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 2048,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 2048
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 126081,
+    "pad_type_id": 0,
+    "pad_token": "<|endoftext|>"
+  },
   "added_tokens": [
     {
       "id": 126080,