adgw commited on
Commit
fcaba40
·
verified ·
1 Parent(s): 49f89bc

Update benchmark leaderboard

Browse files
Files changed (1) hide show
  1. index.html +5 -2
index.html CHANGED
@@ -117,8 +117,11 @@
117
  </head>
118
  <body>
119
  <h1>Text Quality Rating Benchmark</h1>
120
- <p class="subtitle">LLM accuracy at rating text quality on a 1–6 scale across multiple languages</p>
121
- <p class="subtitle">Samples was labeled by Deepseek 3.2 and judged by Gemini 3 Flash</p>
 
 
 
122
 
123
  <div class="scoring-note">
124
  <span><span class="dot" style="background:#22c55e"></span>Exact match = 1.0 pt</span>
 
117
  </head>
118
  <body>
119
  <h1>Text Quality Rating Benchmark</h1>
120
+ <p class="meta-subtitle">
121
+ LLM accuracy at rating text quality on a 1–6 scale across multiple languages
122
+ <span class="sep">·</span> Labeled by DeepSeek V3.2 &amp; judged by Gemini 2.0 Flash
123
+ <span class="sep">·</span> Documents sourced from FineWeb dataset
124
+ </p>
125
 
126
  <div class="scoring-note">
127
  <span><span class="dot" style="background:#22c55e"></span>Exact match = 1.0 pt</span>