Zaynes commited on
Commit
1fa60ee
·
verified ·
1 Parent(s): 47cf229

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: mit
4
+ ---
5
+
6
+ # M-r1_distill_baseline-rl
7
+
8
+ ## Model Details
9
+
10
+ - **Training Method**: VeRL Reinforcement Learning (RL)
11
+ - **Stage Name**: rl
12
+ - **Experiment**: r1_distill_baseline
13
+ - **RL Framework**: VeRL (Versatile Reinforcement Learning)
14
+
15
+ ## Training Configuration
16
+
17
+ ## Experiment Tracking
18
+
19
+ 🔗 **View complete experiment details**: https://huggingface.co/datasets/TAUR-dev/D-ExpTracker__r1_distill_baseline__v1
20
+
21
+ ## Usage
22
+
23
+ ```python
24
+ from transformers import AutoTokenizer, AutoModelForCausalLM
25
+
26
+ tokenizer = AutoTokenizer.from_pretrained("TAUR-dev/M-r1_distill_baseline-rl")
27
+ model = AutoModelForCausalLM.from_pretrained("TAUR-dev/M-r1_distill_baseline-rl")
28
+ ```