Token Classification
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
stepwise-reward-trainer
text-generation-inference
Instructions to use trl-lib/Qwen2-0.5B-Reward-Math-Sheperd with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use trl-lib/Qwen2-0.5B-Reward-Math-Sheperd with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("token-classification", model="trl-lib/Qwen2-0.5B-Reward-Math-Sheperd")# Load model directly from transformers import AutoTokenizer, AutoModelForTokenClassification tokenizer = AutoTokenizer.from_pretrained("trl-lib/Qwen2-0.5B-Reward-Math-Sheperd") model = AutoModelForTokenClassification.from_pretrained("trl-lib/Qwen2-0.5B-Reward-Math-Sheperd") - Notebooks
- Google Colab
- Kaggle
Training in progress, step 5500
Browse files
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1976170816
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4471ac9e6e03ba4f251cc5796a5affa4783f5c76e24c3f90e933c3589adc1329
|
| 3 |
size 1976170816
|
runs/Dec09_19-51-10_ip-26-0-171-21/events.out.tfevents.1733773892.ip-26-0-171-21.2852281.0
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3504fac4b3e3630d5b0776ec521fb873bbc701479d2281998473f03b1aa1d56c
|
| 3 |
+
size 87325
|