Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,6 @@ This model improves the instruction-following capabilities of Qwen-2.5-7B-Instru
|
|
| 11 |
We report performance on instruction-following and general-chat benchmarks, using GPT-5-mini as the judge. Additional evaluation details and settings are provided in the paper.
|
| 12 |
**Alpacaeval/Arena-Hard**:
|
| 13 |
| Model | Alpacaeval (Vanilla) | Alpacaeval (Length-Controlled) | Arena-Hard (Vanilla) | Arena-Hard (Style-Controlled) |
|
| 14 |
-
|---|---|---|---|---|
|
| 15 |
| Qwen-2.5-7B-Instruct | 37.1 | 25.32 | 42.4 | 44.2 |
|
| 16 |
| + PROSPER | 55.4 | 37.61 | 49.2 | 46.1 |
|
|
|
|
| 11 |
We report performance on instruction-following and general-chat benchmarks, using GPT-5-mini as the judge. Additional evaluation details and settings are provided in the paper.
|
| 12 |
**Alpacaeval/Arena-Hard**:
|
| 13 |
| Model | Alpacaeval (Vanilla) | Alpacaeval (Length-Controlled) | Arena-Hard (Vanilla) | Arena-Hard (Style-Controlled) |
|
| 14 |
+
|---|---|---|---|---|
|
| 15 |
| Qwen-2.5-7B-Instruct | 37.1 | 25.32 | 42.4 | 44.2 |
|
| 16 |
| + PROSPER | 55.4 | 37.61 | 49.2 | 46.1 |
|