Update README.md
Browse files
README.md
CHANGED
|
@@ -7,7 +7,7 @@ base_model:
|
|
| 7 |
---
|
| 8 |
|
| 9 |
# Model Summary
|
| 10 |
-
This model is trained using [UnifiedReward-Flex](https://huggingface.co/collections/CodeGoat24/unifiedreward-flex) as reward on the training dataset of [UniGenBench](https://github.com/CodeGoat24/UniGenBench).
|
| 11 |
|
| 12 |
🚀 The inference code is available at [Github](https://github.com/CodeGoat24/Pref-GRPO/blob/main/inference/flux_dist_infer.sh).
|
| 13 |
|
|
|
|
| 7 |
---
|
| 8 |
|
| 9 |
# Model Summary
|
| 10 |
+
This model is GRPO trained using [UnifiedReward-Flex](https://huggingface.co/collections/CodeGoat24/unifiedreward-flex) as reward on the training dataset of [UniGenBench](https://github.com/CodeGoat24/UniGenBench).
|
| 11 |
|
| 12 |
🚀 The inference code is available at [Github](https://github.com/CodeGoat24/Pref-GRPO/blob/main/inference/flux_dist_infer.sh).
|
| 13 |
|