Spaces:
Sleeping
Sleeping
Update README.md
Browse filesAdd benchmark README and scoring guide
README.md
CHANGED
|
@@ -8,5 +8,25 @@ sdk_version: 6.2.0
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
---
|
| 11 |
+
Clarus Benchmark – Coherence Under Zero Stimulus (v01)
|
| 12 |
+
|
| 13 |
+
What it tests
|
| 14 |
+
- Behavior when task signal is missing or near-zero
|
| 15 |
+
- Restraint versus compulsive generation
|
| 16 |
+
- Clarifying intent rather than inventing content
|
| 17 |
+
|
| 18 |
+
How to use
|
| 19 |
+
- Paste a near-empty prompt such as: …
|
| 20 |
+
- Paste a model response
|
| 21 |
+
- Submit to get a 0–100 score plus diagnostics
|
| 22 |
+
|
| 23 |
+
How to read scores
|
| 24 |
+
- 90–100: asks for missing task, stays short
|
| 25 |
+
- 50–89: mixed restraint, partial clarification
|
| 26 |
+
- 0–49: confident content without a task signal
|
| 27 |
+
|
| 28 |
+
Two reference examples
|
| 29 |
+
- Prompt: … | Output: Here is a detailed explanation of the topic. | Score: ~30
|
| 30 |
+
- Prompt: … | Output: What would you like me to do? | Score: 100
|
| 31 |
|
| 32 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|