Update README.md
Browse files
README.md
CHANGED
|
@@ -107,7 +107,7 @@ Use the code below to get started with the model.
|
|
| 107 |
- This section provides information about throughput, start/end time, checkpoint size if relevant, etc.
|
| 108 |
|
| 109 |
[More Information Needed]
|
| 110 |
-
|
| 111 |
## Evaluation
|
| 112 |
|
| 113 |
This section describes the evaluation protocols and provides the results.
|
|
@@ -116,26 +116,24 @@ Use the code below to get started with the model.
|
|
| 116 |
|
| 117 |
#### Testing Data
|
| 118 |
|
| 119 |
-
This
|
| 120 |
-
|
| 121 |
-
[More Information Needed]
|
| 122 |
|
|
|
|
| 123 |
#### Factors
|
| 124 |
|
| 125 |
These are the things the evaluation is disaggregating by, e.g., subpopulations or domains.
|
| 126 |
|
| 127 |
[More Information Needed]
|
| 128 |
-
|
| 129 |
#### Metrics
|
| 130 |
|
| 131 |
-
|
| 132 |
-
|
| 133 |
-
[More Information Needed]
|
| 134 |
|
| 135 |
### Results
|
| 136 |
|
| 137 |
-
|
| 138 |
|
|
|
|
| 139 |
#### Summary
|
| 140 |
|
| 141 |
|
|
|
|
| 107 |
- This section provides information about throughput, start/end time, checkpoint size if relevant, etc.
|
| 108 |
|
| 109 |
[More Information Needed]
|
| 110 |
+
-->
|
| 111 |
## Evaluation
|
| 112 |
|
| 113 |
This section describes the evaluation protocols and provides the results.
|
|
|
|
| 116 |
|
| 117 |
#### Testing Data
|
| 118 |
|
| 119 |
+
This model was tested on `jigsaw_toxic_pred` testset.
|
|
|
|
|
|
|
| 120 |
|
| 121 |
+
<!--
|
| 122 |
#### Factors
|
| 123 |
|
| 124 |
These are the things the evaluation is disaggregating by, e.g., subpopulations or domains.
|
| 125 |
|
| 126 |
[More Information Needed]
|
| 127 |
+
-->
|
| 128 |
#### Metrics
|
| 129 |
|
| 130 |
+
Model was evaluated using `perplexity` (on the MLM task).
|
|
|
|
|
|
|
| 131 |
|
| 132 |
### Results
|
| 133 |
|
| 134 |
+
Perplexity: _1.03_
|
| 135 |
|
| 136 |
+
<!--
|
| 137 |
#### Summary
|
| 138 |
|
| 139 |
|