Deepu1965's picture
Add evaluation metrics for bonus2-multitask
173ab9a verified
# Bonus 2: Multitask MoE (XSum)
## Metrics
- ROUGE-1: 0.0000
- ROUGE-2: 0.0000
- ROUGE-L: 0.0000
- ROUGE-Lsum: 0.0000
- SacreBLEU: 0.0000
- BERTScore (P/R/F1): 0.7473 / 0.8181 / 0.7809
- Compression ratio: 0.2472
- Extractiveness: 0.6044
- NLI factual consistency: 0.4948