Update README.md
Browse files
README.md
CHANGED
|
@@ -279,8 +279,8 @@ We also selected three authoritative datasets containing benign samples:
|
|
| 279 |
|
| 280 |
+ **fka/awesome-chatgpt-prompts**[<sup>[awesome]</sup>](#awesome): 203 English samples
|
| 281 |
+ **StrongReject-Benign**[<sup>[Chi2024]</sup>](#Chi2024): 3,800 English samples, the benign portion of StrongReject
|
| 282 |
-
+ **COIG-
|
| 283 |
-
> COIG-
|
| 284 |
|
| 285 |
By merging the six datasets listed above, we constructed a comprehensive test set to evaluate the effectiveness of the proposed method on third-party data. This test set contains:
|
| 286 |
|
|
|
|
| 279 |
|
| 280 |
+ **fka/awesome-chatgpt-prompts**[<sup>[awesome]</sup>](#awesome): 203 English samples
|
| 281 |
+ **StrongReject-Benign**[<sup>[Chi2024]</sup>](#Chi2024): 3,800 English samples, the benign portion of StrongReject
|
| 282 |
+
+ **COIG-CQIA**[<sup>[CQIA]</sup>](#CQIA): 44,694 Chinese samples
|
| 283 |
+
> COIG-CQIA (Chinese Open Instruction Generalist - *Quality is All You Need*) is an open-source, high-quality instruction tuning dataset designed to support human-aligned interactions in the Chinese NLP community.
|
| 284 |
|
| 285 |
By merging the six datasets listed above, we constructed a comprehensive test set to evaluate the effectiveness of the proposed method on third-party data. This test set contains:
|
| 286 |
|