AniFileBERT / data /dmhy /mixed_train.manifest.json
ModerRAS's picture
完成整个数据集的整理
f4f4e0e
raw
history blame contribute delete
215 Bytes
{
"synthetic": "data/synthetic.jsonl",
"dmhy": "data/dmhy/dmhy_weak.jsonl",
"output": "data/dmhy/mixed_train.jsonl",
"synthetic_count": 100000,
"dmhy_count": 632002,
"total_count": 732002,
"seed": 42
}