tasksource
/

deberta-base-long-nli

Zero-Shot Classification

text-classification

Model card Files Files and versions

sileod commited on Jul 29, 2024

Commit

2106a8a

·

verified ·

1 Parent(s): 944c071

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -310,7 +310,7 @@ This checkpoint has strong zero-shot validation performance on many tasks (e.g.
 | anli/a2                     |            47.2 |
 | anli/a3                     |            49.4 |
 | nli_fever                   |            79.4 |
-| folio                       |            61.8 |
 | ConTRoL-nli                 |            63.3 |
 | cladder                     |            71.1 |
 | zero-shot-label-nli         |            74.4 |
@@ -318,6 +318,8 @@ This checkpoint has strong zero-shot validation performance on many tasks (e.g.
 | oasst2_pairwise_rlhf_reward |            73.9 |
 | doc-nli                     |            90.0 |
 # [ZS] Zero-shot classification pipeline
 ```python
 from transformers import pipeline

 | anli/a2                     |            47.2 |
 | anli/a3                     |            49.4 |
 | nli_fever                   |            79.4 |
+| FOLIO                       |            61.8 |
 | ConTRoL-nli                 |            63.3 |
 | cladder                     |            71.1 |
 | zero-shot-label-nli         |            74.4 |
 | oasst2_pairwise_rlhf_reward |            73.9 |
 | doc-nli                     |            90.0 |
+Zero-shot GPT-4 scores 61% on FOLIO (logical reasoning), 62% on cladder (probabilistic reasoning) and 56.4% on ConTRoL (long context NLI).
 # [ZS] Zero-shot classification pipeline
 ```python
 from transformers import pipeline