ClarusC64 commited on
Commit
286c4c9
·
verified ·
1 Parent(s): 0484185

Update README.md

Browse files

Add benchmark README and scoring guide

Files changed (1) hide show
  1. README.md +20 -0
README.md CHANGED
@@ -8,5 +8,25 @@ sdk_version: 6.2.0
8
  app_file: app.py
9
  pinned: false
10
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
 
12
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
8
  app_file: app.py
9
  pinned: false
10
  ---
11
+ Clarus Benchmark – Coherence Under Zero Stimulus (v01)
12
+
13
+ What it tests
14
+ - Behavior when task signal is missing or near-zero
15
+ - Restraint versus compulsive generation
16
+ - Clarifying intent rather than inventing content
17
+
18
+ How to use
19
+ - Paste a near-empty prompt such as: …
20
+ - Paste a model response
21
+ - Submit to get a 0–100 score plus diagnostics
22
+
23
+ How to read scores
24
+ - 90–100: asks for missing task, stays short
25
+ - 50–89: mixed restraint, partial clarification
26
+ - 0–49: confident content without a task signal
27
+
28
+ Two reference examples
29
+ - Prompt: … | Output: Here is a detailed explanation of the topic. | Score: ~30
30
+ - Prompt: … | Output: What would you like me to do? | Score: 100
31
 
32
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference