rjac
/

bert-20news-classification

Text Classification

generated_from_keras_callback

text-embeddings-inference

Model card Files Files and versions

rjac commited on May 22, 2023

Commit

d0df317

·

1 Parent(s): f74d522

Update README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -22,15 +22,16 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure

 ## Model description
+This model is a fine-tuned version of the DistilBERT model for sequence classification tasks. It was trained using Hugging Face's transformers and TensorFlow. The model expects input sequences to be tokenized according to the DistilBERT's tokenizer.
+The model was trained specifically for classifying text into 20 different categories derived from the 20 Newsgroups dataset. These categories include various topics such as 'alt.atheism', 'comp.graphics', 'comp.os.ms-windows.misc', 'comp.sys.ibm.pc.hardware', 'comp.sys.mac.hardware', 'comp.windows.x', 'misc.forsale', 'rec.autos', 'rec.motorcycles', 'rec.sport.baseball', 'rec.sport.hockey', 'sci.crypt', 'sci.electronics', 'sci.med', 'sci.space', 'soc.religion.christian', 'talk.politics.guns', 'talk.politics.mideast', 'talk.politics.misc', 'talk.religion.misc'.
+## Intended uses & limitations
+This model is intended for classifying text into the above mentioned 20 categories. It can be used for categorizing text data from similar domains or topics.
+## Training and evaluation data
+the model was trained on 90% of the data from the 20 Newsgroups dataset, with the remaining 10% used for validation.
 ## Training procedure