soughtlin commited on
Commit
9b43e02
·
verified ·
1 Parent(s): 52e442c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -4,8 +4,8 @@
4
 
5
  The dataset includes Small Training (100k), Large Training (10k), Validation (500), and Test (200) sets in `.jsonl` format.
6
 
7
- * **Download Link:** [Baidu Netdisk](https://pan.baidu.com/s/1TuaGjNvTESt9ZdEQy1BogA?pwd=u9i2) or you can directily use data/precessed...
8
- * **Note:** If resources are limited, you may use 10k samples from the Small Training Set, though using the Large Training Set is encouraged.
9
 
10
  Download checkpoints and respective config.yaml, and put them under the directory "runs/train"
11
 
 
4
 
5
  The dataset includes Small Training (100k), Large Training (10k), Validation (500), and Test (200) sets in `.jsonl` format.
6
 
7
+ * **Download Link:** [Baidu Netdisk](https://pan.baidu.com/s/1TuaGjNvTESt9ZdEQy1BogA?pwd=u9i2)
8
+ * **Note:** You can preprocess the data by preprocess.py or directly use "data/processed_nltk_100k"
9
 
10
  Download checkpoints and respective config.yaml, and put them under the directory "runs/train"
11