Token Classification
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
stepwise-reward-trainer
text-generation-inference
Instructions to use trl-lib/Qwen2-0.5B-Reward-Math-Sheperd with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use trl-lib/Qwen2-0.5B-Reward-Math-Sheperd with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("token-classification", model="trl-lib/Qwen2-0.5B-Reward-Math-Sheperd")# Load model directly from transformers import AutoTokenizer, AutoModelForTokenClassification tokenizer = AutoTokenizer.from_pretrained("trl-lib/Qwen2-0.5B-Reward-Math-Sheperd") model = AutoModelForTokenClassification.from_pretrained("trl-lib/Qwen2-0.5B-Reward-Math-Sheperd") - Notebooks
- Google Colab
- Kaggle
Training in progress, step 6601
Browse files
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1976170816
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8a17e389f9f01efb0db5ebf1336ed493c6f877ebd5c9944225f5536bda264167
|
| 3 |
size 1976170816
|
runs/Dec09_19-51-10_ip-26-0-171-21/events.out.tfevents.1733773892.ip-26-0-171-21.2852281.0
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:49b68cfcf66ec6f8d863a0921ad7478f797fe633c819e2fda7f00b63de3fcc73
|
| 3 |
+
size 104069
|