Update README.md
Browse files
README.md
CHANGED
|
@@ -64,5 +64,8 @@ The output then looks like:
|
|
| 64 |
[Score: 0.896] When will the recession happen? <sep> never <sep> upcoming January-March
|
| 65 |
```
|
| 66 |
|
|
|
|
|
|
|
|
|
|
| 67 |
|
| 68 |
In the paper, we find that a threshold of `T=2.75` achieves the highest F1 score on the validation portions of the two datasets. In the above example, only the first three pairs would be classified as equivalent answers, and all pairs below would be labeled as non-equivalent answers.
|
|
|
|
| 64 |
[Score: 0.896] When will the recession happen? <sep> never <sep> upcoming January-March
|
| 65 |
```
|
| 66 |
|
| 67 |
+
Ethical Considerations
|
| 68 |
+
This release is for research purposes only in support of an academic paper. Our models, datasets, and code are not specifically designed or evaluated for all downstream purposes. We strongly recommend users evaluate and address potential concerns related to accuracy, safety, and fairness before deploying this model. We encourage users to consider the common limitations of AI, comply with applicable laws, and leverage best practices when selecting use cases, particularly for high-risk scenarios where errors or misuse could significantly impact people’s lives, rights, or safety. For further guidance on use cases, refer to our AUP and AI AUP.
|
| 69 |
+
|
| 70 |
|
| 71 |
In the paper, we find that a threshold of `T=2.75` achieves the highest F1 score on the validation portions of the two datasets. In the above example, only the first three pairs would be classified as equivalent answers, and all pairs below would be labeled as non-equivalent answers.
|