mnavas
/

roberta-finetuned-WebClassification

Text Classification

Generated from Trainer

text-embeddings-inference

Model card Files Files and versions

Metrics Training metrics Community

mnavas commited on Mar 23, 2023

Commit

429da02

·

1 Parent(s): 093afa1

Update README.md

Files changed (1) hide show

README.md +22 -5

README.md CHANGED Viewed

@@ -10,6 +10,7 @@ metrics:
 model-index:
 - name: roberta-finetuned-WebClassification
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -17,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 # roberta-finetuned-WebClassification
-This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.3473
 - Accuracy: 0.9504
@@ -27,15 +28,31 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
@@ -71,4 +88,4 @@ The following hyperparameters were used during training:
 - Transformers 4.16.2
 - Pytorch 1.9.1
 - Datasets 1.18.4
-- Tokenizers 0.11.6

 model-index:
 - name: roberta-finetuned-WebClassification
   results: []
+pipeline_tag: text-classification
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # roberta-finetuned-WebClassification
+This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the [Web Classification Dataset](https://www.kaggle.com/datasets/hetulmehta/website-classification).
 It achieves the following results on the evaluation set:
 - Loss: 0.3473
 - Accuracy: 0.9504
 ## Model description
+The model classifies websites into the following categories:
+- "0": "Adult",
+- "1": "Business/Corporate",
+- "2": "Computers and Technology",
+- "3": "E-Commerce",
+- "4": "Education",
+- "5": "Food",
+- "6": "Forums",
+- "7": "Games",
+- "8": "Health and Fitness",
+- "9": "Law and Government",
+- "10": "News",
+- "11": "Photography",
+- "12": "Social Networking and Messaging",
+- "13": "Sports",
+- "14": "Streaming Services",
+- "15": "Travel"
 ## Intended uses & limitations
+Web classification in English (for now).
 ## Training and evaluation data
+Trained and tested on a 80/20 split of the [Web Classification Dataset](https://www.kaggle.com/datasets/hetulmehta/website-classification).
 ## Training procedure
 - Transformers 4.16.2
 - Pytorch 1.9.1
 - Datasets 1.18.4
+- Tokenizers 0.11.6