Update README.md
Browse files
README.md
CHANGED
|
@@ -73,8 +73,8 @@ print("predicted rating:", pred)
|
|
| 73 |
|---|---|
|
| 74 |
| Base model | hfl/chinese-roberta-wwm-ext |
|
| 75 |
| Training framework | 🤗 transformers `Trainer` |
|
| 76 |
-
| Training set |
|
| 77 |
-
| Validation set |
|
| 78 |
| Test set | full original test set |
|
| 79 |
| Max sequence length | 256 |
|
| 80 |
| Training epochs | 3 |
|
|
@@ -85,7 +85,7 @@ print("predicted rating:", pred)
|
|
| 85 |
| Scheduler | linear warmup (warmup_ratio=0.1) |
|
| 86 |
| Precision | FP16 |
|
| 87 |
| Best-model criterion | **QWK (↑)** |
|
| 88 |
-
| Training time | ≈
|
| 89 |
| Logging interval | every 10 steps |
|
| 90 |
|
| 91 |
---
|
|
|
|
| 73 |
|---|---|
|
| 74 |
| Base model | hfl/chinese-roberta-wwm-ext |
|
| 75 |
| Training framework | 🤗 transformers `Trainer` |
|
| 76 |
+
| Training set | 150 000 samples (randomly drawn from 2000 K) |
|
| 77 |
+
| Validation set | 15 000 samples (same random draw) |
|
| 78 |
| Test set | full original test set |
|
| 79 |
| Max sequence length | 256 |
|
| 80 |
| Training epochs | 3 |
|
|
|
|
| 85 |
| Scheduler | linear warmup (warmup_ratio=0.1) |
|
| 86 |
| Precision | FP16 |
|
| 87 |
| Best-model criterion | **QWK (↑)** |
|
| 88 |
+
| Training time | ≈ 120 min on single P100 (FP16) |
|
| 89 |
| Logging interval | every 10 steps |
|
| 90 |
|
| 91 |
---
|