Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
SirajRLX
/
task2file
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
0e9d230
task2file
1.66 GB
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
SirajRLX
Add files using upload-large-folder tool
0e9d230
verified
4 months ago
DPO-14b
Upload folder using huggingface_hub
4 months ago
best_adapter
Add files using upload-large-folder tool
4 months ago
checkpoints
Add files using upload-large-folder tool
4 months ago
dpo_run_14B
Upload folder using huggingface_hub
4 months ago
logs
Add files using upload-large-folder tool
4 months ago
wandb
Add files using upload-large-folder tool
4 months ago
.gitattributes
2.32 kB
Add files using upload-large-folder tool
4 months ago
config_resolved.yaml
Safe
3.61 kB
Add files using upload-large-folder tool
4 months ago
eval_final.json
Safe
201 Bytes
Add files using upload-large-folder tool
4 months ago