autoevaluator's picture
autoevaluator HF Staff
Add evaluation results on the samsum config and test split of samsum
ce6506e
|
raw
history blame
3.98 kB
metadata
language: en
license: apache-2.0
tags:
  - summarization
datasets:
  - cnn_dailymail
  - xsum
thumbnail: https://huggingface.co/front/thumbnails/distilbart_medium.png
model-index:
  - name: sshleifer/distilbart-xsum-12-6
    results:
      - task:
          type: summarization
          name: Summarization
        dataset:
          name: samsum
          type: samsum
          config: samsum
          split: test
        metrics:
          - type: rouge
            value: 20.3249
            name: ROUGE-1
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZTc2YzZmZmIwMGViZDk5NGY3Y2EwZTZkNWQ3N2FjNjM0MzdjYjI4N2UyMTYzYjg2NWNlZGFmZjg3ZjAzY2M4MSIsInZlcnNpb24iOjF9.FSB_urEdPUK-S8wjkAuefy4v3DGym2UHv4xDsnyHc0D2n3LXSc7SbcrMmGNhqIUbXk03hVCv_vszq5jqzHN2BQ
          - type: rouge
            value: 3.5106
            name: ROUGE-2
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiY2FhMDc5YjQ3NWQ4YjM2MDZlYTJkODMxNmY1MjRlMjBiYmU1ODE4MWM0YmVmNjg5YWNkOGQ2NzIwYTExZjExZiIsInZlcnNpb24iOjF9.xP1OJrqJuBLAr4okD9gpjQWwOddgJc72ve7wgaXWyZBAHUdY0eC63MWg-Iv8WsuCM_AxaXCjY4EFDYqtKGSYCw
          - type: rouge
            value: 15.3062
            name: ROUGE-L
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZjcyM2NhZDJjNjIyNDEwOTdjNzczNzQwMDZiZThjNmY0NDcyNmIyYmE4ODgyOWY3YTdhM2YyYzI2MWY1NDg0YiIsInZlcnNpb24iOjF9.63qid3xnpX0SVEqsjRrWcc5eukbKmZ5nUGdppwzzPoIEsr6keuhBATzxIjLIQhDceXwrLoY408f5tcKa-s6ADQ
          - type: rouge
            value: 16.9328
            name: ROUGE-LSUM
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNGFhMDg5ZGFmZTFiMTkwNWM5ZDY4ODFiNDM0ZjVlYzgwYTQ2NDU3OGI1YmRkNzVjODE4OGU0YTA0MDZkNjljYSIsInZlcnNpb24iOjF9.Rk8JUhDQq1284_eD_HQdR8RQvIJ76nOUjfjENQ5qbhfB8tnQDoqBaXnusVR9768Qi1cEdoKbX4zlVGJLUvVWDQ
          - type: loss
            value: 4.240785598754883
            name: loss
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYjJmOTNmM2E2NzUzMjQ3YTU5YWI1OTg1NmMyZDQyNDAyMTc3Y2E3ZjY0M2RjMzIyNGNiMDE0ZWY0YzkxZjQ0NSIsInZlcnNpb24iOjF9.NqjYH3-SUEOO-jZX9giulkdpW5wue6nAtzMWhYfVpfQdQ7S_Xouf7fvakXkXCAO-APpGnW4FcdSjcafCgwy6CQ
          - type: gen_len
            value: 20.1758
            name: gen_len
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMWJiNjAwZGVmMDQxODY0NGYwOGE4ZGQyNWQ4NzdkMjA4MWQ0MmM0OWY2NTRhMzk1MTNiYmQ5OGJhMzYwYzc1OCIsInZlcnNpb24iOjF9.XPL3qXhy9ud7P904zkFWWrvfDrNXmmXIYeCBEnQyTilpAoqyrI2v1FCoio9NhWAecJT7-2iC4MAxEOG_9yj9DQ

Usage

This checkpoint should be loaded into BartForConditionalGeneration.from_pretrained. See the BART docs for more information.

Metrics for DistilBART models

Model Name MM Params Inference Time (MS) Speedup Rouge 2 Rouge-L
distilbart-xsum-12-1 222 90 2.54 18.31 33.37
distilbart-xsum-6-6 230 132 1.73 20.92 35.73
distilbart-xsum-12-3 255 106 2.16 21.37 36.39
distilbart-xsum-9-6 268 136 1.68 21.72 36.61
bart-large-xsum (baseline) 406 229 1 21.85 36.50
distilbart-xsum-12-6 306 137 1.68 22.12 36.99
bart-large-cnn (baseline) 406 381 1 21.06 30.63
distilbart-12-3-cnn 255 214 1.78 20.57 30.00
distilbart-12-6-cnn 306 307 1.24 21.26 30.59
distilbart-6-6-cnn 230 182 2.09 20.17 29.70