Fill-Mask
Transformers
Safetensors
mt5
text2text-generation
htdung167 commited on
Commit
5525f4a
·
verified ·
1 Parent(s): 7a8372d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -6,7 +6,7 @@ tags: []
6
  # 5CD-AI/visocial-T5-base
7
  ## Overview
8
  <!-- Provide a quick summary of what the model is/does. -->
9
- We continually pretrain `google/mt5-base`[1] on a merged 20GB dataset, the training dataset includes:
10
  - Internal data (100M comments and 15M posts on Facebook)
11
  - UIT data[2], which is used to pretrain `uitnlp/visobert`[2]
12
  - MC4 ecommerce
 
6
  # 5CD-AI/visocial-T5-base
7
  ## Overview
8
  <!-- Provide a quick summary of what the model is/does. -->
9
+ We trimmed vocabulary size to 50,589 and continually pretrained `google/mt5-base`[1] on a merged 20GB dataset, the training dataset includes:
10
  - Internal data (100M comments and 15M posts on Facebook)
11
  - UIT data[2], which is used to pretrain `uitnlp/visobert`[2]
12
  - MC4 ecommerce