Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
miyuki2026
/
OpenMiniMind
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
OpenMiniMind
/
examples
/
tutorials
/
ppo
/
gpt2_sst2
54.5 kB
1 contributor
History:
1 commit
miyuki2026
update
a1cb0be
16 days ago
step_1_prepare_data.py
Safe
1.39 kB
update
16 days ago
step_2_train_sft_model.py
Safe
4.9 kB
update
16 days ago
step_3_train_reward_model.py
Safe
9.45 kB
update
16 days ago
step_4_test_reward_model.py
Safe
5.1 kB
update
16 days ago
step_5_ppo_rlhf.py
Safe
8.95 kB
update
16 days ago
step_5_ppo_rlhf2.py
Safe
15.7 kB
update
16 days ago
step_5_pre_ppo_rlhf.py
Safe
9.03 kB
update
16 days ago