Update README.md
Browse files
README.md
CHANGED
|
@@ -22,9 +22,9 @@ The model is fine-tuned and tested on the natural language inference (NLI) datas
|
|
| 22 |
|
| 23 |
Below is a confusion matrix calculated on zero-shot inferences for the 10 most popular categories in the Test split of [reddgr/nli-chatbot-prompt-categorization](https://huggingface.co/datasets/reddgr/nli-chatbot-prompt-categorization) at the time of the first model upload. The classification with the base model on the same small test dataset is shown for comparison:
|
| 24 |
|
| 25 |
-

|
| 30 |
|
|
|
|
| 22 |
|
| 23 |
Below is a confusion matrix calculated on zero-shot inferences for the 10 most popular categories in the Test split of [reddgr/nli-chatbot-prompt-categorization](https://huggingface.co/datasets/reddgr/nli-chatbot-prompt-categorization) at the time of the first model upload. The classification with the base model on the same small test dataset is shown for comparison:
|
| 24 |
|
| 25 |
+

|
| 26 |
|
| 27 |
+
The current version of the fine-tuned model outperforms the base model [facebook/bart-large-mnli](https://huggingface.co/facebook/bart-large-mnli) by 23 percentage points (57% accuracy vs 34% accuracy) in a test set with 10 candidate zero-shot classes (the most frequent categories in the test split of [reddgr/nli-chatbot-prompt-categorization](https://huggingface.co/datasets/reddgr/nli-chatbot-prompt-categorization)).
|
| 28 |
|
| 29 |
The dataset and the model are continously updated as they assist with content publishing on my website [Talking to Chatbots](https://talkingtochatbots)
|
| 30 |
|