approach0
/

dpr-cotbert-320

Model card Files Files and versions

Metrics Training metrics Community

w32zhong commited on Nov 4, 2021

Commit

e922b87

·

unverified ·

1 Parent(s): 107cd34

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -1,5 +1,7 @@
 ## About
-Here we share a pretrained bert model that is aware of math tokens. The math tokens are treated specially and are tokenized using [pya0](https://github.com/approach0/pya0), which adds very limited new tokens for latex markup (total vocabulary is just 31061).
 ### Usage
 Download and try it out
@@ -14,7 +16,7 @@ python test.py --test_file test.txt
 ### Test file format
 Modify the test examples in `test.txt` to play with it.
-The test file is tab separated, the first column is additional positions you want to mask for the right-side sentence (useful for masking tokens in math markups). An zero means no additional mask positions.
 ### Example output
 ![](https://i.imgur.com/xpl87KO.png)

 ## About
+Here we share a pretrained BERT model that is aware of math tokens. The math tokens are treated specially and tokenized using [pya0](https://github.com/approach0/pya0), which adds very limited new tokens for latex markup (total vocabulary is just 31,061).
+This model is trained on 4 x 2 Tesla V100 with a total batch size of 64, using Math StackExchange data with 2.7 million sentence pairs for 7 epochs.
 ### Usage
 Download and try it out
 ### Test file format
 Modify the test examples in `test.txt` to play with it.
+The test file is tab-separated, the first column is additional positions you want to mask for the right-side sentence (useful for masking tokens in math markups). A zero means no additional mask positions.
 ### Example output
 ![](https://i.imgur.com/xpl87KO.png)