Artanic30
/

DA-DPO_llava_v1.5_7B

Reinforcement Learning

Model card Files Files and versions

Metrics Training metrics Community

DA-DPO_llava_v1.5_7B

212 MB

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

Artanic30's picture

Update README.md

552cde0 verified 3 months ago

runs
Upload folder using huggingface_hub 3 months ago
.gitattributes

1.52 kB
initial commit 3 months ago
README.md

471 Bytes
Update README.md 3 months ago
adapter_config.json

561 Bytes
Upload folder using huggingface_hub 3 months ago
adapter_model.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.BFloat16Storage"
What is a pickle import?
160 MB
xet

Upload folder using huggingface_hub 3 months ago
config.json

1.36 kB
Upload folder using huggingface_hub 3 months ago
non_lora_trainables.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.BFloat16Storage"
What is a pickle import?
42 MB
xet

Upload folder using huggingface_hub 3 months ago
output.log

3.27 MB
Upload folder using huggingface_hub 3 months ago
trainer_state.json

2.69 MB
Upload folder using huggingface_hub 3 months ago