Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
SirajRLX
/
task2file
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
task2file
/
dpo_run_14B
/
wandb
3.26 MB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
SirajRLX
Upload folder using huggingface_hub
8b2d0c7
verified
4 months ago
run-20251226_152332-r9hfat2g
Upload folder using huggingface_hub
4 months ago
run-20251226_152936-r1nptay8
Upload folder using huggingface_hub
4 months ago
run-20251226_155650-wbzoafvt
Upload folder using huggingface_hub
4 months ago
debug-internal.log
Safe
1.14 kB
Upload folder using huggingface_hub
4 months ago
debug.log
Safe
12.8 kB
Upload folder using huggingface_hub
4 months ago