model training desc: 做知识选择,使用QuALITY数据集,随机选择的知识和关键句训练 2023-12-16 23:07:37.734 | INFO | __main__:init_components:108 - Initializing components... Loading checkpoint shards: 0%| | 0/2 [00:00. If you see this, DO NOT PANIC! This is expected, and simply means that the `legacy` (previous) behavior will be used so nothing changes for you. If you want to use the new behaviour, set `legacy=False`. This should only be set if you understand what it means, and thouroughly read the reason why this was added as explained in https://github.com/huggingface/transformers/pull/24565 2023-12-16 23:08:41.148 | INFO | __main__:init_components:155 - 2023-12-16 23:08:41.148 | INFO | __main__:init_components:156 - ******************** 2023-12-16 23:08:41.148 | INFO | __main__:init_components:157 - using llama2 model 2023-12-16 23:08:41.148 | INFO | __main__:init_components:158 - ******************** 2023-12-16 23:08:41.148 | INFO | __main__:init_components:159 - memory footprint of model: 4.024436950683594 GB trainable params: 319,815,680 || all params: 7,058,231,296 || trainable%: 4.531102291607305 2023-12-16 23:08:44.549 | INFO | component.dataset:__init__:14 - Loading data: /data0/maqi/KGLQA-data/datasets/QuALITY/random_select/with_knowledge_without_select_instruction/train.jsonl 2023-12-16 23:08:44.647 | INFO | component.dataset:__init__:19 - there are 2523 data in dataset 2023-12-16 23:08:44.773 | INFO | __main__:main:231 - *** starting training *** 0%| | 0/630 [00:00