tteofili
/

gminus

@@ -107,7 +107,7 @@ Use the code below to get started with the model.
 - This section provides information about throughput, start/end time, checkpoint size if relevant, etc.
 [More Information Needed]
 ## Evaluation
  This section describes the evaluation protocols and provides the results.
@@ -116,26 +116,24 @@ Use the code below to get started with the model.
 #### Testing Data
- This should link to a Data Card if possible.
-[More Information Needed]
 #### Factors
  These are the things the evaluation is disaggregating by, e.g., subpopulations or domains.
 [More Information Needed]
 #### Metrics
- These are the evaluation metrics being used, ideally with a description of why.
-[More Information Needed]
 ### Results
-[More Information Needed]
 #### Summary

 - This section provides information about throughput, start/end time, checkpoint size if relevant, etc.
 [More Information Needed]
+-->
 ## Evaluation
  This section describes the evaluation protocols and provides the results.
 #### Testing Data
+ This model was tested on `jigsaw_toxic_pred` testset.
+<!--
 #### Factors
  These are the things the evaluation is disaggregating by, e.g., subpopulations or domains.
 [More Information Needed]
+-->
 #### Metrics
+ Model was evaluated using `perplexity` (on the MLM task).
 ### Results
+Perplexity: _1.03_
+<!--
 #### Summary