ustc-zyt commited on
Commit
1b55753
Β·
verified Β·
1 Parent(s): 3074d9e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -3
README.md CHANGED
@@ -1,3 +1,45 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - ustc-zyt/time-r1-data
5
+ language:
6
+ - en
7
+ metrics:
8
+ - mse
9
+ - mae
10
+ ---
11
+ # 🧠 Time-R1 Reinforced Model Weights
12
+
13
+ These are the official **reinforcement learning (RL) fine-tuned model checkpoints** for the paper:
14
+ **"Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs"**.
15
+
16
+ ---
17
+
18
+ ## πŸ“¦ Model Details
19
+
20
+ * **Base Model**: Qwen2.5-7B
21
+ * **Tuning Framework**: [Verl](https://github.com/volcengine/verl) + [LLaMA Factory](https://github.com/hiyouga/LLaMA-Factory)
22
+ * **Final Stage**: Trained using GRIP (Group-based Relative Importance Policy optimization)
23
+ * **Objective**: Multi-horizon time series forecasting with structured reasoning
24
+
25
+ ---
26
+
27
+ ## πŸ“¦ Files Included
28
+
29
+ This model follows the standard Hugging Face `transformers` format and uses the efficient `safetensors` backend.
30
+
31
+ ```
32
+ Time-R1/
33
+ β”œβ”€β”€ config.json
34
+ β”œβ”€β”€ generation_config.json
35
+ β”œβ”€β”€ model.safetensors.index.json
36
+ β”œβ”€β”€ model-00001-of-00004.safetensors
37
+ β”œβ”€β”€ model-00002-of-00004.safetensors
38
+ β”œβ”€β”€ model-00003-of-00004.safetensors
39
+ β”œβ”€β”€ model-00004-of-00004.safetensors
40
+ β”œβ”€β”€ tokenizer_config.json
41
+ β”œβ”€β”€ tokenizer.json
42
+ └── vocab.json
43
+ ```
44
+
45
+ βœ… Fully compatible with Hugging Face `transformers` and `AutoModelForCausalLM`.