sarkerlab
/

SocBERT-final

Model card Files Files and versions

yguo262 commited on Mar 20, 2023

Commit

7a1539c

·

1 Parent(s): 904b8cb

Create README.md

Files changed (1) hide show

README.md +15 -0

README.md ADDED Viewed

	@@ -0,0 +1,15 @@

+# SocBERT model
+Pretrained model on 20GB English tweets and 72GB Reddit comments using a masked language modeling (MLM) objective.
+The model was trained from scratch following the model architecture of RoBERTa-base.
+We benchmarked SocBERT, on 40 text classification tasks with social media data.
+The experiment results can be found in our paper:
+```
+@inproceedings{socbert:2023,
+title     = {{SocBERT: A Pretrained Model for Social Media Text}},
+author    = {Yuting Guo and Abeed Sarker},
+booktitle = {Proceedings of the Fourth Workshop on Insights from Negative Results in NLP},
+year      = {2023}
+}
+```
+A base version of the model can be found at [SocBERT-base](https://huggingface.co/sarkerlab/SocBERT-base).