Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Artanic30
/
DA-DPO_llava_v1.5_7B
like
0
Reinforcement Learning
TensorBoard
English
llava_bpo
arxiv:
2601.00623
License:
apache-2.0
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
DA-DPO_llava_v1.5_7B
212 MB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
Artanic30
Update README.md
552cde0
verified
3 months ago
runs
Upload folder using huggingface_hub
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
3 months ago
README.md
471 Bytes
Update README.md
3 months ago
adapter_config.json
561 Bytes
Upload folder using huggingface_hub
3 months ago
adapter_model.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
What is a pickle import?
160 MB
xet
Upload folder using huggingface_hub
3 months ago
config.json
1.36 kB
Upload folder using huggingface_hub
3 months ago
non_lora_trainables.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
What is a pickle import?
42 MB
xet
Upload folder using huggingface_hub
3 months ago
output.log
3.27 MB
Upload folder using huggingface_hub
3 months ago
trainer_state.json
2.69 MB
Upload folder using huggingface_hub
3 months ago