CodeGoat24 commited on
Commit
fe83bcb
·
verified ·
1 Parent(s): cb0bf77

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -7,7 +7,7 @@ base_model:
7
  ---
8
 
9
  # Model Summary
10
- This model is trained using [UnifiedReward-Flex](https://huggingface.co/collections/CodeGoat24/unifiedreward-flex) as reward on the training dataset of [UniGenBench](https://github.com/CodeGoat24/UniGenBench).
11
 
12
  🚀 The inference code is available at [Github](https://github.com/CodeGoat24/Pref-GRPO/blob/main/inference/flux_dist_infer.sh).
13
 
 
7
  ---
8
 
9
  # Model Summary
10
+ This model is GRPO trained using [UnifiedReward-Flex](https://huggingface.co/collections/CodeGoat24/unifiedreward-flex) as reward on the training dataset of [UniGenBench](https://github.com/CodeGoat24/UniGenBench).
11
 
12
  🚀 The inference code is available at [Github](https://github.com/CodeGoat24/Pref-GRPO/blob/main/inference/flux_dist_infer.sh).
13