Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
SirajRLX
/
task2file
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
b66e341
task2file
7.7 GB
Ctrl+K
Ctrl+K
1 contributor
History:
9 commits
SirajRLX
Add files using upload-large-folder tool
b66e341
verified
4 months ago
DPO-14b
Upload folder using huggingface_hub
4 months ago
best_adapter
Add files using upload-large-folder tool
4 months ago
checkpoints
Add files using upload-large-folder tool
4 months ago
dpo_run_14B
Upload folder using huggingface_hub
4 months ago
logs
Add files using upload-large-folder tool
4 months ago
wandb
Add files using upload-large-folder tool
4 months ago
.gitattributes
2.72 kB
Add files using upload-large-folder tool
4 months ago
config_resolved.yaml
3.68 kB
Add files using upload-large-folder tool
4 months ago
eval_final.json
Safe
199 Bytes
Add files using upload-large-folder tool
4 months ago