christofid commited on
Commit
eb29b5a
·
1 Parent(s): 6d0cfa5

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+ ### dapSciBERT
5
+
6
+ DapSciBERT is a BERT-like model trained based on the domain adaptive pretraining method ([Gururangan et al.](https://aclanthology.org/2020.acl-main.740/)) for the patent domain. Allenai/scibert_scivocab_uncased is used as base for the training. The training dataset used consists of a corpus of 10,000,000
7
+ patent abstracts that have been filed between 1998-2020 in US and European patent offices as well as the World Intellectual Property Organization.