Token Classification
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
stepwise-reward-trainer
text-generation-inference
Instructions to use trl-lib/Qwen2-0.5B-Reward-Math-Sheperd with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use trl-lib/Qwen2-0.5B-Reward-Math-Sheperd with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("token-classification", model="trl-lib/Qwen2-0.5B-Reward-Math-Sheperd")# Load model directly from transformers import AutoTokenizer, AutoModelForTokenClassification tokenizer = AutoTokenizer.from_pretrained("trl-lib/Qwen2-0.5B-Reward-Math-Sheperd") model = AutoModelForTokenClassification.from_pretrained("trl-lib/Qwen2-0.5B-Reward-Math-Sheperd") - Notebooks
- Google Colab
- Kaggle
Training in progress, step 6500
Browse files
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1976170816
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5aacd40dfbf4569b08e9a91783cca06087d54188959684c10171e885c8e4b40d
|
| 3 |
size 1976170816
|
runs/Dec09_19-51-10_ip-26-0-171-21/events.out.tfevents.1733773892.ip-26-0-171-21.2852281.0
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:517c403bbc65dd08cb47aecbeeb2f6262d141008a8bbf826c28191f7c4aeb5cf
|
| 3 |
+
size 102225
|