Zaynes commited on
Commit
0afbb1f
·
verified ·
1 Parent(s): 3b75a96

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: mit
4
+ ---
5
+
6
+ # M-1117_newmodels__qwen7b_R1Distill_ct3arg-rl
7
+
8
+ ## Model Details
9
+
10
+ - **Training Method**: VeRL Reinforcement Learning (RL)
11
+ - **Stage Name**: rl
12
+ - **Experiment**: 1117_newmodels__qwen7b_R1Distill_ct3arg
13
+ - **RL Framework**: VeRL (Versatile Reinforcement Learning)
14
+
15
+ ## Training Configuration
16
+
17
+ ## Experiment Tracking
18
+
19
+ 🔗 **View complete experiment details**: https://huggingface.co/datasets/TAUR-dev/D-ExpTracker__1117_newmodels__qwen7b_R1Distill_ct3arg__v1
20
+
21
+ ## Usage
22
+
23
+ ```python
24
+ from transformers import AutoTokenizer, AutoModelForCausalLM
25
+
26
+ tokenizer = AutoTokenizer.from_pretrained("TAUR-dev/M-1117_newmodels__qwen7b_R1Distill_ct3arg-rl")
27
+ model = AutoModelForCausalLM.from_pretrained("TAUR-dev/M-1117_newmodels__qwen7b_R1Distill_ct3arg-rl")
28
+ ```