Spaces:
Running
Running
File size: 1,082 Bytes
7b72b2c | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 | ---
dimension: groundedness
scale: binary
reference_based: true
abstain_allowed: true
---
# Groundedness (binary)
Score whether every claim in the answer is supported by the gold source snippets.
## Score 0
Answer contains at least one claim not supported by the snippets.
### Example A — answer cites unsupported fact
Question: "What's the default port?"
Snippets: ["The default is 8080."]
Answer: "The default is 8080 and supports TLS."
Score=0 because the TLS claim has no support in the snippet. The
unsupported claim is sufficient to fail groundedness regardless of
how many other claims are correctly grounded — this is the binary
rubric's strict-conjunction definition.
## Score 1
Every claim in the answer is supported by at least one snippet.
### Example B — fully grounded one-sentence answer
Question: "What's the default port?"
Snippets: ["The default is 8080."]
Answer: "The default port is 8080."
Score=1 because the only claim ("default port is 8080") is directly
supported by the snippet. Paraphrase is allowed; what matters is
factual entailment.
|