WARNING:torch.distributed.run: ***************************************** Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. ***************************************** model training desc: 使用随机选择的关键句训练 2023-12-07 13:36:09.354 | INFO | __main__:init_components:108 - Initializing components... model training desc: 使用随机选择的关键句训练 2023-12-07 13:36:09.360 | INFO | __main__:init_components:108 - Initializing components... You are using the default legacy behaviour of the . If you see this, DO NOT PANIC! This is expected, and simply means that the `legacy` (previous) behavior will be used so nothing changes for you. If you want to use the new behaviour, set `legacy=False`. This should only be set if you understand what it means, and thouroughly read the reason why this was added as explained in https://github.com/huggingface/transformers/pull/24565 2023-12-07 13:36:37.046 | INFO | __main__:init_components:143 - 2023-12-07 13:36:37.046 | INFO | __main__:init_components:144 - ******************** 2023-12-07 13:36:37.046 | INFO | __main__:init_components:145 - using TechGPT-7B 2023-12-07 13:36:37.046 | INFO | __main__:init_components:146 - ******************** 2023-12-07 13:36:37.046 | INFO | __main__:init_components:147 - memory footprint of model: 5.472740173339844 GB You are using the default legacy behaviour of the . If you see this, DO NOT PANIC! This is expected, and simply means that the `legacy` (previous) behavior will be used so nothing changes for you. If you want to use the new behaviour, set `legacy=False`. This should only be set if you understand what it means, and thouroughly read the reason why this was added as explained in https://github.com/huggingface/transformers/pull/24565 2023-12-07 13:36:37.818 | INFO | __main__:init_components:143 - 2023-12-07 13:36:37.819 | INFO | __main__:init_components:144 - ******************** 2023-12-07 13:36:37.819 | INFO | __main__:init_components:145 - using TechGPT-7B 2023-12-07 13:36:37.819 | INFO | __main__:init_components:146 - ******************** 2023-12-07 13:36:37.819 | INFO | __main__:init_components:147 - memory footprint of model: 5.472740173339844 GB trainable params: 319,815,680 || all params: 7,447,007,232 || trainable%: 4.294553100818044 2023-12-07 13:36:39.748 | INFO | component.dataset:__init__:14 - Loading data: /data0/maqi/KGLQA-data/datasets/NCR/random_select/ncr_random_1400_instruct/train.jsonl 2023-12-07 13:36:39.846 | INFO | component.dataset:__init__:19 - there are 15319 data in dataset 2023-12-07 13:36:39.938 | INFO | __main__:main:231 - *** starting training *** trainable params: 319,815,680 || all params: 7,447,007,232 || trainable%: 4.294553100818044 2023-12-07 13:36:40.517 | INFO | component.dataset:__init__:14 - Loading data: /data0/maqi/KGLQA-data/datasets/NCR/random_select/ncr_random_1400_instruct/train.jsonl 2023-12-07 13:36:40.618 | INFO | component.dataset:__init__:19 - there are 15319 data in dataset 2023-12-07 13:36:40.712 | INFO | __main__:main:231 - *** starting training *** 0%| | 0/1149 [00:00