Update README.md
Browse files
README.md
CHANGED
|
@@ -6,7 +6,7 @@ tags: []
|
|
| 6 |
# 5CD-AI/visocial-T5-base
|
| 7 |
## Overview
|
| 8 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 9 |
-
We continually
|
| 10 |
- Internal data (100M comments and 15M posts on Facebook)
|
| 11 |
- UIT data[2], which is used to pretrain `uitnlp/visobert`[2]
|
| 12 |
- MC4 ecommerce
|
|
|
|
| 6 |
# 5CD-AI/visocial-T5-base
|
| 7 |
## Overview
|
| 8 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 9 |
+
We trimmed vocabulary size to 50,589 and continually pretrained `google/mt5-base`[1] on a merged 20GB dataset, the training dataset includes:
|
| 10 |
- Internal data (100M comments and 15M posts on Facebook)
|
| 11 |
- UIT data[2], which is used to pretrain `uitnlp/visobert`[2]
|
| 12 |
- MC4 ecommerce
|