Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
miyuki2026
/
OpenMiniMind
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
OpenMiniMind
/
examples
/
tutorials
/
dpo
/
ultrafeedback-dpo
19.1 kB
1 contributor
History:
20 commits
miyuki2026
update
76d081c
16 days ago
requirements.txt
Safe
71 Bytes
update
18 days ago
step_1_prepare_data.py
Safe
1.4 kB
update
17 days ago
step_2_train_dpo_model_ddp_qlora.py
Safe
8.73 kB
update
16 days ago
step_2_train_dpo_model_single_gpu_qlora.py
Safe
7.85 kB
update
17 days ago
step_6_push_to_modelscope.py
Safe
1.09 kB
update
18 days ago