yguo262 commited on
Commit
7a1539c
·
1 Parent(s): 904b8cb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # SocBERT model
2
+ Pretrained model on 20GB English tweets and 72GB Reddit comments using a masked language modeling (MLM) objective.
3
+ The model was trained from scratch following the model architecture of RoBERTa-base.
4
+ We benchmarked SocBERT, on 40 text classification tasks with social media data.
5
+ The experiment results can be found in our paper:
6
+ ```
7
+ @inproceedings{socbert:2023,
8
+ title = {{SocBERT: A Pretrained Model for Social Media Text}},
9
+ author = {Yuting Guo and Abeed Sarker},
10
+ booktitle = {Proceedings of the Fourth Workshop on Insights from Negative Results in NLP},
11
+ year = {2023}
12
+ }
13
+ ```
14
+
15
+ A base version of the model can be found at [SocBERT-base](https://huggingface.co/sarkerlab/SocBERT-base).