Tianzhou
/

finbert-pretrain

Trained with AutoTrain

Model card Files Files and versions

federicopascual commited on Jul 27, 2022

Commit

e02aba1

·

1 Parent(s): ab93dff

Updated model card

Files changed (1) hide show

README.md +0 -2

README.md CHANGED Viewed

@@ -19,8 +19,6 @@ It is trained on the following three financial communication corpus. The total c
 - Corporate Reports 10-K & 10-Q: 2.5B tokens
 - Earnings Call Transcripts: 1.3B tokens
 - Analyst Reports: 1.1B tokens
-- Demo.org Proprietary Reports
-- Additional purchased data from Factset
 The entire training is done using an **NVIDIA DGX-1** machine. The server has 4 Tesla P100 GPUs, providing a total of 128 GB of GPU memory. This machine enables us to train the BERT models using a batch size of 128. We utilize Horovord framework for multi-GPU training. Overall, the total time taken to perform pretraining for one model is approximately **2 days**.

 - Corporate Reports 10-K & 10-Q: 2.5B tokens
 - Earnings Call Transcripts: 1.3B tokens
 - Analyst Reports: 1.1B tokens
 The entire training is done using an **NVIDIA DGX-1** machine. The server has 4 Tesla P100 GPUs, providing a total of 128 GB of GPU memory. This machine enables us to train the BERT models using a batch size of 128. We utilize Horovord framework for multi-GPU training. Overall, the total time taken to perform pretraining for one model is approximately **2 days**.