Add evaluation results on the plain_text config and validation split of launch/gov_report
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the plain_text config and validation split of the [launch/gov_report](https://huggingface.co/datasets/launch/gov_report) dataset by @nonchalant-nagavalli, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-launch__gov_report-plain_text-7b7f8a-16126221).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=launch/gov_report).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=launch/gov_report).
README.md
CHANGED
|
@@ -78,6 +78,39 @@ model-index:
|
|
| 78 |
type: gen_len
|
| 79 |
value: 186.2003
|
| 80 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 81 |
---
|
| 82 |
|
| 83 |
# BigBirdPegasus model (large)
|
|
|
|
| 78 |
type: gen_len
|
| 79 |
value: 186.2003
|
| 80 |
verified: true
|
| 81 |
+
- task:
|
| 82 |
+
type: summarization
|
| 83 |
+
name: Summarization
|
| 84 |
+
dataset:
|
| 85 |
+
name: launch/gov_report
|
| 86 |
+
type: launch/gov_report
|
| 87 |
+
config: plain_text
|
| 88 |
+
split: validation
|
| 89 |
+
metrics:
|
| 90 |
+
- name: ROUGE-1
|
| 91 |
+
type: rouge
|
| 92 |
+
value: 26.6836
|
| 93 |
+
verified: true
|
| 94 |
+
- name: ROUGE-2
|
| 95 |
+
type: rouge
|
| 96 |
+
value: 3.937
|
| 97 |
+
verified: true
|
| 98 |
+
- name: ROUGE-L
|
| 99 |
+
type: rouge
|
| 100 |
+
value: 15.2322
|
| 101 |
+
verified: true
|
| 102 |
+
- name: ROUGE-LSUM
|
| 103 |
+
type: rouge
|
| 104 |
+
value: 23.5209
|
| 105 |
+
verified: true
|
| 106 |
+
- name: loss
|
| 107 |
+
type: loss
|
| 108 |
+
value: 7.046688556671143
|
| 109 |
+
verified: true
|
| 110 |
+
- name: gen_len
|
| 111 |
+
type: gen_len
|
| 112 |
+
value: 183.2895
|
| 113 |
+
verified: true
|
| 114 |
---
|
| 115 |
|
| 116 |
# BigBirdPegasus model (large)
|