Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
selink
/
Qwen2.5-0.5B-Instruct-Reward
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
reward-trainer
trl
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
Qwen2.5-0.5B-Instruct-Reward
41.6 GB
1 contributor
History:
2 commits
selink
Upload folder using huggingface_hub
4faccd9
verified
about 2 months ago
checkpoint-2000
Upload folder using huggingface_hub
about 2 months ago
checkpoint-20000
Upload folder using huggingface_hub
about 2 months ago
checkpoint-20500
Upload folder using huggingface_hub
about 2 months ago
checkpoint-21000
Upload folder using huggingface_hub
about 2 months ago
checkpoint-21500
Upload folder using huggingface_hub
about 2 months ago
checkpoint-21831
Upload folder using huggingface_hub
about 2 months ago
checkpoint-2500
Upload folder using huggingface_hub
about 2 months ago
runs
Upload folder using huggingface_hub
about 2 months ago
.gitattributes
1.99 kB
Upload folder using huggingface_hub
about 2 months ago
README.md
1.3 kB
Upload folder using huggingface_hub
about 2 months ago