Update README.md
Browse files
README.md
CHANGED
|
@@ -10,6 +10,7 @@ licence: license
|
|
| 10 |
---
|
| 11 |
|
| 12 |
# Model Card for Qwen3.0-1.7B-Reward
|
|
|
|
| 13 |
|
| 14 |
This model is a fine-tuned version of [Qwen/Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B).
|
| 15 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
|
|
|
| 10 |
---
|
| 11 |
|
| 12 |
# Model Card for Qwen3.0-1.7B-Reward
|
| 13 |
+
Use https://huggingface.co/Realmbird/helpfulness-preference-model-qwen-0.6B-merged instead due to a tokenizer mismatch
|
| 14 |
|
| 15 |
This model is a fine-tuned version of [Qwen/Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B).
|
| 16 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|