HYdsl commited on
Commit
68ce11a
ยท
1 Parent(s): 4521bcd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -28,5 +28,12 @@ This is our foundational model, trained on the entire range of corpora as outlin
28
 
29
  This model is specialized for handling SEC filings. We expanded the training set by adding 3.1 billion tokens from the SEC filings corpus dataset. The dataset is sourced from EDGAR-CORPUS: Billions of Tokens Make The World Go Round (Loukas et al., ECONLP 2021) and can be downloaded from Zenodo. ๐Ÿ“‘
30
 
 
 
 
 
 
 
 
31
  **Types of Training Corpora ๐Ÿ“š**
32
  ![image.png](https://cdn-uploads.huggingface.co/production/uploads/65254614785092cd47b1110b/-cT_wOabHugsct1mogOpa.png)
 
28
 
29
  This model is specialized for handling SEC filings. We expanded the training set by adding 3.1 billion tokens from the SEC filings corpus dataset. The dataset is sourced from EDGAR-CORPUS: Billions of Tokens Make The World Go Round (Loukas et al., ECONLP 2021) and can be downloaded from Zenodo. ๐Ÿ“‘
30
 
31
+ The method to load a tokenizer and a model.
32
+ For the FiLM model, you can call 'roberta-base' from the tokenizer.
33
+ ```python
34
+ tokenizer = AutoTokenizer.from_pretrained('roberta-base')
35
+ model = AutoModel.from_pretrained('HYdsl/FiLM')
36
+ ```
37
+
38
  **Types of Training Corpora ๐Ÿ“š**
39
  ![image.png](https://cdn-uploads.huggingface.co/production/uploads/65254614785092cd47b1110b/-cT_wOabHugsct1mogOpa.png)