diff --git "a/ncr_cclue/without_knowledge_random/train.log" "b/ncr_cclue/without_knowledge_random/train.log" new file mode 100644--- /dev/null +++ "b/ncr_cclue/without_knowledge_random/train.log" @@ -0,0 +1,259 @@ +model training desc: 不做知识选择,使用NCR+CCLUE数据集,随机选择的关键句训练 +2023-12-10 12:22:55.340 | INFO | __main__:init_components:108 - Initializing components... +You are using the default legacy behaviour of the . If you see this, DO NOT PANIC! This is expected, and simply means that the `legacy` (previous) behavior will be used so nothing changes for you. If you want to use the new behaviour, set `legacy=False`. This should only be set if you understand what it means, and thouroughly read the reason why this was added as explained in https://github.com/huggingface/transformers/pull/24565 +2023-12-10 12:24:20.449 | INFO | __main__:init_components:143 - + +2023-12-10 12:24:20.449 | INFO | __main__:init_components:144 - ******************** +2023-12-10 12:24:20.449 | INFO | __main__:init_components:145 - using TechGPT-7B +2023-12-10 12:24:20.449 | INFO | __main__:init_components:146 - ******************** +2023-12-10 12:24:20.449 | INFO | __main__:init_components:147 - + +memory footprint of model: 5.472740173339844 GB +trainable params: 319,815,680 || all params: 7,447,007,232 || trainable%: 4.294553100818044 +2023-12-10 12:24:23.862 | INFO | component.dataset:__init__:14 - Loading data: /data0/maqi/KGLQA-data/datasets/merge/random_select/without_knowledge_random_instruction/train.jsonl +2023-12-10 12:24:24.188 | INFO | component.dataset:__init__:19 - there are 18892 data in dataset +2023-12-10 12:24:24.205 | INFO | __main__:main:231 - *** starting training *** + 0%| | 0/4722 [00:00