MisDrifter commited on
Commit
df6d8ac
·
verified ·
1 Parent(s): 69d1696

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,6 +11,6 @@ This model improves the instruction-following capabilities of Qwen-2.5-7B-Instru
11
  We report performance on instruction-following and general-chat benchmarks, using GPT-5-mini as the judge. Additional evaluation details and settings are provided in the paper.
12
  **Alpacaeval/Arena-Hard**:
13
  | Model | Alpacaeval (Vanilla) | Alpacaeval (Length-Controlled) | Arena-Hard (Vanilla) | Arena-Hard (Style-Controlled) |
14
- |---|---|---|---|---|---|
15
  | Qwen-2.5-7B-Instruct | 37.1 | 25.32 | 42.4 | 44.2 |
16
  | + PROSPER | 55.4 | 37.61 | 49.2 | 46.1 |
 
11
  We report performance on instruction-following and general-chat benchmarks, using GPT-5-mini as the judge. Additional evaluation details and settings are provided in the paper.
12
  **Alpacaeval/Arena-Hard**:
13
  | Model | Alpacaeval (Vanilla) | Alpacaeval (Length-Controlled) | Arena-Hard (Vanilla) | Arena-Hard (Style-Controlled) |
14
+ |---|---|---|---|---|
15
  | Qwen-2.5-7B-Instruct | 37.1 | 25.32 | 42.4 | 44.2 |
16
  | + PROSPER | 55.4 | 37.61 | 49.2 | 46.1 |