Mistral-7B-PT-TEST / running_log.txt
SamChen888's picture
Upload folder using huggingface_hub
4588d97 verified
[WARNING|2025-03-09 01:23:22] logging.py:162 >> We recommend enable `upcast_layernorm` in quantized training.
[INFO|2025-03-09 01:23:22] parser.py:355 >> Process rank: 0, device: cuda:0, n_gpu: 1, distributed training: False, compute dtype: torch.bfloat16
[INFO|2025-03-09 01:23:22] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/config.json
[INFO|2025-03-09 01:23:22] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "mistralai/Mistral-7B-Instruct-v0.3",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"use_cache": true,
"vocab_size": 32768
}
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file tokenizer.model from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/tokenizer.model
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file tokenizer.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/tokenizer.json
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file added_tokens.json from cache at None
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file special_tokens_map.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/special_tokens_map.json
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file tokenizer_config.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/tokenizer_config.json
[INFO|2025-03-09 01:23:23] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/config.json
[INFO|2025-03-09 01:23:23] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "mistralai/Mistral-7B-Instruct-v0.3",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"use_cache": true,
"vocab_size": 32768
}
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file tokenizer.model from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/tokenizer.model
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file tokenizer.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/tokenizer.json
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file added_tokens.json from cache at None
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file special_tokens_map.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/special_tokens_map.json
[INFO|2025-03-09 01:23:23] tokenization_utils_base.py:2211 >> loading file tokenizer_config.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/tokenizer_config.json
[INFO|2025-03-09 01:23:24] logging.py:157 >> Add pad token: </s>
[INFO|2025-03-09 01:23:24] logging.py:157 >> Loading dataset treino_novo_pt.json...
[INFO|2025-03-09 01:23:25] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--mistralai--Mistral-7B-Instruct-v0.3/snapshots/e0bc86c23ce5aae1db576c8cca6f06f1f73af2db/config.json
[INFO|2025-03-09 01:23:25] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "mistralai/Mistral-7B-Instruct-v0.3",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"use_cache": true,
"vocab_size": 32768
}
[INFO|2025-03-09 01:23:25] logging.py:157 >> Quantizing model to 4 bit with bitsandbytes.
[INFO|2025-03-09 01:23:29] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unsloth--mistral-7b-instruct-v0.3-bnb-4bit/snapshots/d5f623888f1415cf89b5c208d09cb620694618ee/config.json
[INFO|2025-03-09 01:23:29] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "unsloth/mistral-7b-instruct-v0.3-bnb-4bit",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"quantization_config": {
"_load_in_4bit": true,
"_load_in_8bit": false,
"bnb_4bit_compute_dtype": "bfloat16",
"bnb_4bit_quant_storage": "uint8",
"bnb_4bit_quant_type": "nf4",
"bnb_4bit_use_double_quant": true,
"llm_int8_enable_fp32_cpu_offload": false,
"llm_int8_has_fp16_weight": false,
"llm_int8_skip_modules": null,
"llm_int8_threshold": 6.0,
"load_in_4bit": true,
"load_in_8bit": false,
"quant_method": "bitsandbytes"
},
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"unsloth_version": "2024.9",
"use_cache": true,
"vocab_size": 32768
}
[INFO|2025-03-09 01:23:29] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unslothai--aws/snapshots/66e4c14a24a0b445779c922eef992a4af0694a88/config.json
[INFO|2025-03-09 01:23:29] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unslothai--repeat/snapshots/7c48478c02f84ed89f149b0815cc0216ee831fb0/config.json
[INFO|2025-03-09 01:23:30] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unslothai--vram-48/snapshots/3aea312d98ea327daeb5dbf7374b1d7cf8c65bc0/config.json
[INFO|2025-03-09 01:23:30] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unslothai--1/snapshots/7ec782b7604cd9ea0781c23a4270f031650f5617/config.json
[INFO|2025-03-09 01:23:30] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unsloth--mistral-7b-instruct-v0.3-bnb-4bit/snapshots/d5f623888f1415cf89b5c208d09cb620694618ee/config.json
[INFO|2025-03-09 01:23:30] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "unsloth/mistral-7b-instruct-v0.3-bnb-4bit",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"quantization_config": {
"_load_in_4bit": true,
"_load_in_8bit": false,
"bnb_4bit_compute_dtype": "bfloat16",
"bnb_4bit_quant_storage": "uint8",
"bnb_4bit_quant_type": "nf4",
"bnb_4bit_use_double_quant": true,
"llm_int8_enable_fp32_cpu_offload": false,
"llm_int8_has_fp16_weight": false,
"llm_int8_skip_modules": null,
"llm_int8_threshold": 6.0,
"load_in_4bit": true,
"load_in_8bit": false,
"quant_method": "bitsandbytes"
},
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"unsloth_version": "2024.9",
"use_cache": true,
"vocab_size": 32768
}
[INFO|2025-03-09 01:23:30] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unsloth--mistral-7b-instruct-v0.3-bnb-4bit/snapshots/d5f623888f1415cf89b5c208d09cb620694618ee/config.json
[INFO|2025-03-09 01:23:30] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "unsloth/mistral-7b-instruct-v0.3-bnb-4bit",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"quantization_config": {
"_load_in_4bit": true,
"_load_in_8bit": false,
"bnb_4bit_compute_dtype": "bfloat16",
"bnb_4bit_quant_storage": "uint8",
"bnb_4bit_quant_type": "nf4",
"bnb_4bit_use_double_quant": true,
"llm_int8_enable_fp32_cpu_offload": false,
"llm_int8_has_fp16_weight": false,
"llm_int8_skip_modules": null,
"llm_int8_threshold": 6.0,
"load_in_4bit": true,
"load_in_8bit": false,
"quant_method": "bitsandbytes"
},
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"unsloth_version": "2024.9",
"use_cache": true,
"vocab_size": 32768
}
[INFO|2025-03-09 01:23:51] modeling_utils.py:3937 >> loading weights file model.safetensors from cache at /home/zeus/.cache/huggingface/hub/models--unsloth--mistral-7b-instruct-v0.3-bnb-4bit/snapshots/d5f623888f1415cf89b5c208d09cb620694618ee/model.safetensors
[INFO|2025-03-09 01:23:51] modeling_utils.py:1670 >> Instantiating MistralForCausalLM model under default dtype torch.bfloat16.
[INFO|2025-03-09 01:23:51] configuration_utils.py:1096 >> Generate config GenerationConfig {
"bos_token_id": 1,
"eos_token_id": 2,
"pad_token_id": 770
}
[INFO|2025-03-09 01:23:53] modeling_utils.py:4800 >> All model checkpoint weights were used when initializing MistralForCausalLM.
[INFO|2025-03-09 01:23:53] modeling_utils.py:4808 >> All the weights of MistralForCausalLM were initialized from the model checkpoint at unsloth/mistral-7b-instruct-v0.3-bnb-4bit.
If your task is similar to the task the model of the checkpoint was trained on, you can already use MistralForCausalLM for predictions without further training.
[INFO|2025-03-09 01:23:53] configuration_utils.py:1051 >> loading configuration file generation_config.json from cache at /home/zeus/.cache/huggingface/hub/models--unsloth--mistral-7b-instruct-v0.3-bnb-4bit/snapshots/d5f623888f1415cf89b5c208d09cb620694618ee/generation_config.json
[INFO|2025-03-09 01:23:53] configuration_utils.py:1096 >> Generate config GenerationConfig {
"bos_token_id": 1,
"eos_token_id": 2,
"max_length": 32768,
"pad_token_id": 770
}
[INFO|2025-03-09 01:23:55] logging.py:157 >> Gradient checkpointing enabled.
[INFO|2025-03-09 01:23:55] logging.py:157 >> Upcasting trainable params to float32.
[INFO|2025-03-09 01:23:55] logging.py:157 >> Fine-tuning method: LoRA
[INFO|2025-03-09 01:23:55] logging.py:157 >> Found linear modules: up_proj,gate_proj,v_proj,down_proj,q_proj,o_proj,k_proj
[WARNING|2025-03-09 01:23:57] logging.py:168 >> Unsloth 2025.3.9 patched 32 layers with 32 QKV layers, 32 O layers and 32 MLP layers.
[INFO|2025-03-09 01:23:59] logging.py:157 >> trainable params: 20,971,520 || all params: 7,268,995,072 || trainable%: 0.2885
[INFO|2025-03-09 01:23:59] trainer.py:698 >> Using auto half precision backend
[WARNING|2025-03-09 01:23:59] <string>:195 >> ==((====))== Unsloth - 2x faster free finetuning | Num GPUs used = 1
\\ /| Num examples = 3,716 | Num Epochs = 3 | Total steps = 1,392
O^O/ \_/ \ Batch size per device = 4 | Gradient accumulation steps = 2
\ / Data Parallel GPUs = 1 | Total batch size (4 x 2 x 1) = 8
"-____-" Trainable parameters = 20,971,520/3,779,334,144 (0.55% trained)
[WARNING|2025-03-09 01:24:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:24:24] logging.py:157 >> {'loss': 0.2640, 'learning_rate': 2.9996e-05, 'epoch': 0.02}
[WARNING|2025-03-09 01:24:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:24:35] logging.py:157 >> {'loss': 0.1028, 'learning_rate': 2.9985e-05, 'epoch': 0.04}
[WARNING|2025-03-09 01:24:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:24:47] logging.py:157 >> {'loss': 0.1140, 'learning_rate': 2.9966e-05, 'epoch': 0.06}
[WARNING|2025-03-09 01:24:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:24:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:24:59] logging.py:157 >> {'loss': 0.0948, 'learning_rate': 2.9939e-05, 'epoch': 0.09}
[WARNING|2025-03-09 01:24:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:25:11] logging.py:157 >> {'loss': 0.1096, 'learning_rate': 2.9905e-05, 'epoch': 0.11}
[WARNING|2025-03-09 01:25:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:25:23] logging.py:157 >> {'loss': 0.0956, 'learning_rate': 2.9863e-05, 'epoch': 0.13}
[WARNING|2025-03-09 01:25:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:25:35] logging.py:157 >> {'loss': 0.1017, 'learning_rate': 2.9813e-05, 'epoch': 0.15}
[WARNING|2025-03-09 01:25:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:25:47] logging.py:157 >> {'loss': 0.0694, 'learning_rate': 2.9756e-05, 'epoch': 0.17}
[WARNING|2025-03-09 01:25:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:25:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:25:59] logging.py:157 >> {'loss': 0.0690, 'learning_rate': 2.9692e-05, 'epoch': 0.19}
[WARNING|2025-03-09 01:25:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:26:11] logging.py:157 >> {'loss': 0.0861, 'learning_rate': 2.9620e-05, 'epoch': 0.22}
[WARNING|2025-03-09 01:26:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:26:23] logging.py:157 >> {'loss': 0.0829, 'learning_rate': 2.9540e-05, 'epoch': 0.24}
[WARNING|2025-03-09 01:26:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:26:35] logging.py:157 >> {'loss': 0.0811, 'learning_rate': 2.9453e-05, 'epoch': 0.26}
[WARNING|2025-03-09 01:26:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:26:47] logging.py:157 >> {'loss': 0.0762, 'learning_rate': 2.9359e-05, 'epoch': 0.28}
[WARNING|2025-03-09 01:26:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:26:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:26:59] logging.py:157 >> {'loss': 0.0835, 'learning_rate': 2.9257e-05, 'epoch': 0.30}
[WARNING|2025-03-09 01:26:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:27:11] logging.py:157 >> {'loss': 0.0740, 'learning_rate': 2.9149e-05, 'epoch': 0.32}
[WARNING|2025-03-09 01:27:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:27:23] logging.py:157 >> {'loss': 0.0674, 'learning_rate': 2.9033e-05, 'epoch': 0.34}
[WARNING|2025-03-09 01:27:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:27:35] logging.py:157 >> {'loss': 0.0934, 'learning_rate': 2.8909e-05, 'epoch': 0.37}
[WARNING|2025-03-09 01:27:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:27:47] logging.py:157 >> {'loss': 0.0759, 'learning_rate': 2.8779e-05, 'epoch': 0.39}
[WARNING|2025-03-09 01:27:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:27:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:27:59] logging.py:157 >> {'loss': 0.0725, 'learning_rate': 2.8642e-05, 'epoch': 0.41}
[WARNING|2025-03-09 01:27:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:28:11] logging.py:157 >> {'loss': 0.0542, 'learning_rate': 2.8498e-05, 'epoch': 0.43}
[WARNING|2025-03-09 01:28:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:28:23] logging.py:157 >> {'loss': 0.0837, 'learning_rate': 2.8347e-05, 'epoch': 0.45}
[WARNING|2025-03-09 01:28:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:28:35] logging.py:157 >> {'loss': 0.0717, 'learning_rate': 2.8189e-05, 'epoch': 0.47}
[WARNING|2025-03-09 01:28:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:28:48] logging.py:157 >> {'loss': 0.0577, 'learning_rate': 2.8024e-05, 'epoch': 0.50}
[WARNING|2025-03-09 01:28:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:28:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:29:00] logging.py:157 >> {'loss': 0.0705, 'learning_rate': 2.7853e-05, 'epoch': 0.52}
[WARNING|2025-03-09 01:29:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:29:12] logging.py:157 >> {'loss': 0.0716, 'learning_rate': 2.7675e-05, 'epoch': 0.54}
[WARNING|2025-03-09 01:29:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:29:25] logging.py:157 >> {'loss': 0.0800, 'learning_rate': 2.7491e-05, 'epoch': 0.56}
[WARNING|2025-03-09 01:29:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:29:37] logging.py:157 >> {'loss': 0.0741, 'learning_rate': 2.7300e-05, 'epoch': 0.58}
[WARNING|2025-03-09 01:29:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:29:50] logging.py:157 >> {'loss': 0.0631, 'learning_rate': 2.7103e-05, 'epoch': 0.60}
[WARNING|2025-03-09 01:29:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:29:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:30:02] logging.py:157 >> {'loss': 0.0802, 'learning_rate': 2.6900e-05, 'epoch': 0.62}
[WARNING|2025-03-09 01:30:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:30:14] logging.py:157 >> {'loss': 0.0647, 'learning_rate': 2.6691e-05, 'epoch': 0.65}
[WARNING|2025-03-09 01:30:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:30:26] logging.py:157 >> {'loss': 0.0626, 'learning_rate': 2.6476e-05, 'epoch': 0.67}
[WARNING|2025-03-09 01:30:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:30:39] logging.py:157 >> {'loss': 0.0717, 'learning_rate': 2.6255e-05, 'epoch': 0.69}
[WARNING|2025-03-09 01:30:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:30:51] logging.py:157 >> {'loss': 0.0617, 'learning_rate': 2.6029e-05, 'epoch': 0.71}
[WARNING|2025-03-09 01:30:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:30:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:31:03] logging.py:157 >> {'loss': 0.0672, 'learning_rate': 2.5796e-05, 'epoch': 0.73}
[WARNING|2025-03-09 01:31:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:31:16] logging.py:157 >> {'loss': 0.0702, 'learning_rate': 2.5559e-05, 'epoch': 0.75}
[WARNING|2025-03-09 01:31:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:31:28] logging.py:157 >> {'loss': 0.0697, 'learning_rate': 2.5315e-05, 'epoch': 0.78}
[WARNING|2025-03-09 01:31:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:31:40] logging.py:157 >> {'loss': 0.0740, 'learning_rate': 2.5067e-05, 'epoch': 0.80}
[WARNING|2025-03-09 01:31:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:31:52] logging.py:157 >> {'loss': 0.0566, 'learning_rate': 2.4814e-05, 'epoch': 0.82}
[WARNING|2025-03-09 01:31:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:31:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:32:04] logging.py:157 >> {'loss': 0.0792, 'learning_rate': 2.4555e-05, 'epoch': 0.84}
[WARNING|2025-03-09 01:32:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:32:16] logging.py:157 >> {'loss': 0.0445, 'learning_rate': 2.4292e-05, 'epoch': 0.86}
[WARNING|2025-03-09 01:32:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:32:28] logging.py:157 >> {'loss': 0.0553, 'learning_rate': 2.4024e-05, 'epoch': 0.88}
[WARNING|2025-03-09 01:32:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:32:40] logging.py:157 >> {'loss': 0.0555, 'learning_rate': 2.3751e-05, 'epoch': 0.90}
[WARNING|2025-03-09 01:32:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:32:52] logging.py:157 >> {'loss': 0.0611, 'learning_rate': 2.3474e-05, 'epoch': 0.93}
[WARNING|2025-03-09 01:32:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:32:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:33:03] logging.py:157 >> {'loss': 0.0571, 'learning_rate': 2.3192e-05, 'epoch': 0.95}
[WARNING|2025-03-09 01:33:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:33:15] logging.py:157 >> {'loss': 0.0635, 'learning_rate': 2.2907e-05, 'epoch': 0.97}
[WARNING|2025-03-09 01:33:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:33:28] logging.py:157 >> {'loss': 0.0584, 'learning_rate': 2.2617e-05, 'epoch': 0.99}
[WARNING|2025-03-09 01:33:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:33:40] logging.py:157 >> {'loss': 0.0569, 'learning_rate': 2.2323e-05, 'epoch': 1.01}
[WARNING|2025-03-09 01:33:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:33:53] logging.py:157 >> {'loss': 0.0481, 'learning_rate': 2.2026e-05, 'epoch': 1.03}
[WARNING|2025-03-09 01:33:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:33:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:34:05] logging.py:157 >> {'loss': 0.0295, 'learning_rate': 2.1725e-05, 'epoch': 1.05}
[WARNING|2025-03-09 01:34:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:34:17] logging.py:157 >> {'loss': 0.0385, 'learning_rate': 2.1421e-05, 'epoch': 1.08}
[WARNING|2025-03-09 01:34:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:34:29] logging.py:157 >> {'loss': 0.0630, 'learning_rate': 2.1113e-05, 'epoch': 1.10}
[WARNING|2025-03-09 01:34:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:34:41] logging.py:157 >> {'loss': 0.0528, 'learning_rate': 2.0803e-05, 'epoch': 1.12}
[WARNING|2025-03-09 01:34:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:34:53] logging.py:157 >> {'loss': 0.0347, 'learning_rate': 2.0489e-05, 'epoch': 1.14}
[WARNING|2025-03-09 01:34:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:34:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:35:05] logging.py:157 >> {'loss': 0.0490, 'learning_rate': 2.0173e-05, 'epoch': 1.16}
[WARNING|2025-03-09 01:35:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:35:17] logging.py:157 >> {'loss': 0.0471, 'learning_rate': 1.9854e-05, 'epoch': 1.18}
[WARNING|2025-03-09 01:35:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:35:29] logging.py:157 >> {'loss': 0.0600, 'learning_rate': 1.9532e-05, 'epoch': 1.21}
[WARNING|2025-03-09 01:35:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:35:41] logging.py:157 >> {'loss': 0.0430, 'learning_rate': 1.9208e-05, 'epoch': 1.23}
[WARNING|2025-03-09 01:35:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:35:53] logging.py:157 >> {'loss': 0.0510, 'learning_rate': 1.8882e-05, 'epoch': 1.25}
[WARNING|2025-03-09 01:35:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:35:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:36:05] logging.py:157 >> {'loss': 0.0359, 'learning_rate': 1.8554e-05, 'epoch': 1.27}
[WARNING|2025-03-09 01:36:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:36:17] logging.py:157 >> {'loss': 0.0484, 'learning_rate': 1.8225e-05, 'epoch': 1.29}
[WARNING|2025-03-09 01:36:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:36:30] logging.py:157 >> {'loss': 0.0450, 'learning_rate': 1.7893e-05, 'epoch': 1.31}
[WARNING|2025-03-09 01:36:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:36:42] logging.py:157 >> {'loss': 0.0528, 'learning_rate': 1.7560e-05, 'epoch': 1.33}
[WARNING|2025-03-09 01:36:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:36:54] logging.py:157 >> {'loss': 0.0527, 'learning_rate': 1.7226e-05, 'epoch': 1.36}
[WARNING|2025-03-09 01:36:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:36:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:37:06] logging.py:157 >> {'loss': 0.0389, 'learning_rate': 1.6891e-05, 'epoch': 1.38}
[WARNING|2025-03-09 01:37:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:37:18] logging.py:157 >> {'loss': 0.0506, 'learning_rate': 1.6554e-05, 'epoch': 1.40}
[WARNING|2025-03-09 01:37:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:37:30] logging.py:157 >> {'loss': 0.0455, 'learning_rate': 1.6217e-05, 'epoch': 1.42}
[WARNING|2025-03-09 01:37:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:37:42] logging.py:157 >> {'loss': 0.0390, 'learning_rate': 1.5880e-05, 'epoch': 1.44}
[WARNING|2025-03-09 01:37:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:37:54] logging.py:157 >> {'loss': 0.0512, 'learning_rate': 1.5542e-05, 'epoch': 1.46}
[WARNING|2025-03-09 01:37:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:37:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:38:06] logging.py:157 >> {'loss': 0.0263, 'learning_rate': 1.5203e-05, 'epoch': 1.49}
[WARNING|2025-03-09 01:38:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:38:18] logging.py:157 >> {'loss': 0.0481, 'learning_rate': 1.4865e-05, 'epoch': 1.51}
[WARNING|2025-03-09 01:38:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:38:30] logging.py:157 >> {'loss': 0.0417, 'learning_rate': 1.4526e-05, 'epoch': 1.53}
[WARNING|2025-03-09 01:38:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:38:42] logging.py:157 >> {'loss': 0.0486, 'learning_rate': 1.4188e-05, 'epoch': 1.55}
[WARNING|2025-03-09 01:38:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:38:54] logging.py:157 >> {'loss': 0.0249, 'learning_rate': 1.3850e-05, 'epoch': 1.57}
[WARNING|2025-03-09 01:38:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:38:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:39:06] logging.py:157 >> {'loss': 0.0420, 'learning_rate': 1.3513e-05, 'epoch': 1.59}
[WARNING|2025-03-09 01:39:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:39:18] logging.py:157 >> {'loss': 0.0592, 'learning_rate': 1.3176e-05, 'epoch': 1.61}
[WARNING|2025-03-09 01:39:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:39:31] logging.py:157 >> {'loss': 0.0405, 'learning_rate': 1.2841e-05, 'epoch': 1.64}
[WARNING|2025-03-09 01:39:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:39:43] logging.py:157 >> {'loss': 0.0508, 'learning_rate': 1.2506e-05, 'epoch': 1.66}
[WARNING|2025-03-09 01:39:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:39:56] logging.py:157 >> {'loss': 0.0342, 'learning_rate': 1.2173e-05, 'epoch': 1.68}
[WARNING|2025-03-09 01:39:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:39:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:40:07] logging.py:157 >> {'loss': 0.0460, 'learning_rate': 1.1842e-05, 'epoch': 1.70}
[WARNING|2025-03-09 01:40:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:40:19] logging.py:157 >> {'loss': 0.0348, 'learning_rate': 1.1511e-05, 'epoch': 1.72}
[WARNING|2025-03-09 01:40:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:40:31] logging.py:157 >> {'loss': 0.0455, 'learning_rate': 1.1183e-05, 'epoch': 1.74}
[WARNING|2025-03-09 01:40:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:40:43] logging.py:157 >> {'loss': 0.0542, 'learning_rate': 1.0857e-05, 'epoch': 1.77}
[WARNING|2025-03-09 01:40:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:40:55] logging.py:157 >> {'loss': 0.0323, 'learning_rate': 1.0532e-05, 'epoch': 1.79}
[WARNING|2025-03-09 01:40:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:40:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:41:07] logging.py:157 >> {'loss': 0.0456, 'learning_rate': 1.0210e-05, 'epoch': 1.81}
[WARNING|2025-03-09 01:41:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:41:19] logging.py:157 >> {'loss': 0.0370, 'learning_rate': 9.8909e-06, 'epoch': 1.83}
[WARNING|2025-03-09 01:41:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:41:31] logging.py:157 >> {'loss': 0.0452, 'learning_rate': 9.5739e-06, 'epoch': 1.85}
[WARNING|2025-03-09 01:41:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:41:43] logging.py:157 >> {'loss': 0.0536, 'learning_rate': 9.2597e-06, 'epoch': 1.87}
[WARNING|2025-03-09 01:41:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:41:55] logging.py:157 >> {'loss': 0.0553, 'learning_rate': 8.9485e-06, 'epoch': 1.89}
[WARNING|2025-03-09 01:41:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:41:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:42:07] logging.py:157 >> {'loss': 0.0479, 'learning_rate': 8.6403e-06, 'epoch': 1.92}
[WARNING|2025-03-09 01:42:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:42:19] logging.py:157 >> {'loss': 0.0438, 'learning_rate': 8.3353e-06, 'epoch': 1.94}
[WARNING|2025-03-09 01:42:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:42:32] logging.py:157 >> {'loss': 0.0280, 'learning_rate': 8.0338e-06, 'epoch': 1.96}
[WARNING|2025-03-09 01:42:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:42:44] logging.py:157 >> {'loss': 0.0574, 'learning_rate': 7.7358e-06, 'epoch': 1.98}
[WARNING|2025-03-09 01:42:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:42:56] logging.py:157 >> {'loss': 0.0255, 'learning_rate': 7.4414e-06, 'epoch': 2.00}
[WARNING|2025-03-09 01:42:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:42:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:43:08] logging.py:157 >> {'loss': 0.0280, 'learning_rate': 7.1510e-06, 'epoch': 2.02}
[WARNING|2025-03-09 01:43:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:43:20] logging.py:157 >> {'loss': 0.0178, 'learning_rate': 6.8645e-06, 'epoch': 2.05}
[WARNING|2025-03-09 01:43:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:43:32] logging.py:157 >> {'loss': 0.0197, 'learning_rate': 6.5822e-06, 'epoch': 2.07}
[WARNING|2025-03-09 01:43:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:43:44] logging.py:157 >> {'loss': 0.0300, 'learning_rate': 6.3042e-06, 'epoch': 2.09}
[WARNING|2025-03-09 01:43:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:43:56] logging.py:157 >> {'loss': 0.0309, 'learning_rate': 6.0306e-06, 'epoch': 2.11}
[WARNING|2025-03-09 01:43:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:43:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:44:08] logging.py:157 >> {'loss': 0.0262, 'learning_rate': 5.7615e-06, 'epoch': 2.13}
[WARNING|2025-03-09 01:44:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:44:20] logging.py:157 >> {'loss': 0.0337, 'learning_rate': 5.4972e-06, 'epoch': 2.15}
[INFO|2025-03-09 01:44:20] trainer.py:3801 >> Saving model checkpoint to saves/Mistral-7B-Instruct-v0.3/lora/treino_novo_mistral/checkpoint-1000
[INFO|2025-03-09 01:44:20] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unsloth--mistral-7b-instruct-v0.3-bnb-4bit/snapshots/d5f623888f1415cf89b5c208d09cb620694618ee/config.json
[INFO|2025-03-09 01:44:20] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "unsloth/Mistral-7B-Instruct-v0.3",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"quantization_config": {
"_load_in_4bit": true,
"_load_in_8bit": false,
"bnb_4bit_compute_dtype": "bfloat16",
"bnb_4bit_quant_storage": "uint8",
"bnb_4bit_quant_type": "nf4",
"bnb_4bit_use_double_quant": true,
"llm_int8_enable_fp32_cpu_offload": false,
"llm_int8_has_fp16_weight": false,
"llm_int8_skip_modules": null,
"llm_int8_threshold": 6.0,
"load_in_4bit": true,
"load_in_8bit": false,
"quant_method": "bitsandbytes"
},
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"unsloth_version": "2024.9",
"use_cache": true,
"vocab_size": 32768
}
[WARNING|2025-03-09 01:44:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:44:33] logging.py:157 >> {'loss': 0.0365, 'learning_rate': 5.2377e-06, 'epoch': 2.17}
[WARNING|2025-03-09 01:44:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:44:45] logging.py:157 >> {'loss': 0.0228, 'learning_rate': 4.9832e-06, 'epoch': 2.20}
[WARNING|2025-03-09 01:44:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:44:57] logging.py:157 >> {'loss': 0.0278, 'learning_rate': 4.7338e-06, 'epoch': 2.22}
[WARNING|2025-03-09 01:44:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:44:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:45:09] logging.py:157 >> {'loss': 0.0302, 'learning_rate': 4.4896e-06, 'epoch': 2.24}
[WARNING|2025-03-09 01:45:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:45:22] logging.py:157 >> {'loss': 0.0138, 'learning_rate': 4.2507e-06, 'epoch': 2.26}
[WARNING|2025-03-09 01:45:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:45:34] logging.py:157 >> {'loss': 0.0218, 'learning_rate': 4.0174e-06, 'epoch': 2.28}
[WARNING|2025-03-09 01:45:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:45:46] logging.py:157 >> {'loss': 0.0284, 'learning_rate': 3.7896e-06, 'epoch': 2.30}
[WARNING|2025-03-09 01:45:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:45:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:45:58] logging.py:157 >> {'loss': 0.0169, 'learning_rate': 3.5676e-06, 'epoch': 2.33}
[WARNING|2025-03-09 01:45:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:46:11] logging.py:157 >> {'loss': 0.0145, 'learning_rate': 3.3513e-06, 'epoch': 2.35}
[WARNING|2025-03-09 01:46:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:46:23] logging.py:157 >> {'loss': 0.0249, 'learning_rate': 3.1410e-06, 'epoch': 2.37}
[WARNING|2025-03-09 01:46:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:46:35] logging.py:157 >> {'loss': 0.0296, 'learning_rate': 2.9368e-06, 'epoch': 2.39}
[WARNING|2025-03-09 01:46:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:46:48] logging.py:157 >> {'loss': 0.0310, 'learning_rate': 2.7387e-06, 'epoch': 2.41}
[WARNING|2025-03-09 01:46:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:46:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:47:00] logging.py:157 >> {'loss': 0.0294, 'learning_rate': 2.5468e-06, 'epoch': 2.43}
[WARNING|2025-03-09 01:47:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:47:13] logging.py:157 >> {'loss': 0.0352, 'learning_rate': 2.3613e-06, 'epoch': 2.45}
[WARNING|2025-03-09 01:47:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:47:25] logging.py:157 >> {'loss': 0.0177, 'learning_rate': 2.1822e-06, 'epoch': 2.48}
[WARNING|2025-03-09 01:47:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:47:37] logging.py:157 >> {'loss': 0.0320, 'learning_rate': 2.0096e-06, 'epoch': 2.50}
[WARNING|2025-03-09 01:47:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:47:49] logging.py:157 >> {'loss': 0.0244, 'learning_rate': 1.8437e-06, 'epoch': 2.52}
[WARNING|2025-03-09 01:47:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:47:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:48:02] logging.py:157 >> {'loss': 0.0126, 'learning_rate': 1.6844e-06, 'epoch': 2.54}
[WARNING|2025-03-09 01:48:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:48:14] logging.py:157 >> {'loss': 0.0220, 'learning_rate': 1.5320e-06, 'epoch': 2.56}
[WARNING|2025-03-09 01:48:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:48:27] logging.py:157 >> {'loss': 0.0181, 'learning_rate': 1.3864e-06, 'epoch': 2.58}
[WARNING|2025-03-09 01:48:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:48:39] logging.py:157 >> {'loss': 0.0241, 'learning_rate': 1.2477e-06, 'epoch': 2.60}
[WARNING|2025-03-09 01:48:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:48:51] logging.py:157 >> {'loss': 0.0292, 'learning_rate': 1.1160e-06, 'epoch': 2.63}
[WARNING|2025-03-09 01:48:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:48:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:49:03] logging.py:157 >> {'loss': 0.0201, 'learning_rate': 9.9145e-07, 'epoch': 2.65}
[WARNING|2025-03-09 01:49:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:49:16] logging.py:157 >> {'loss': 0.0313, 'learning_rate': 8.7399e-07, 'epoch': 2.67}
[WARNING|2025-03-09 01:49:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:49:28] logging.py:157 >> {'loss': 0.0311, 'learning_rate': 7.6373e-07, 'epoch': 2.69}
[WARNING|2025-03-09 01:49:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:33] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:49:40] logging.py:157 >> {'loss': 0.0134, 'learning_rate': 6.6072e-07, 'epoch': 2.71}
[WARNING|2025-03-09 01:49:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:49:53] logging.py:157 >> {'loss': 0.0348, 'learning_rate': 5.6501e-07, 'epoch': 2.73}
[WARNING|2025-03-09 01:49:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:49:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:50:05] logging.py:157 >> {'loss': 0.0238, 'learning_rate': 4.7666e-07, 'epoch': 2.76}
[WARNING|2025-03-09 01:50:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:15] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:50:17] logging.py:157 >> {'loss': 0.0285, 'learning_rate': 3.9570e-07, 'epoch': 2.78}
[WARNING|2025-03-09 01:50:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:27] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:50:30] logging.py:157 >> {'loss': 0.0284, 'learning_rate': 3.2218e-07, 'epoch': 2.80}
[WARNING|2025-03-09 01:50:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:38] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:50:43] logging.py:157 >> {'loss': 0.0312, 'learning_rate': 2.5614e-07, 'epoch': 2.82}
[WARNING|2025-03-09 01:50:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:44] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:49] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:50:55] logging.py:157 >> {'loss': 0.0259, 'learning_rate': 1.9760e-07, 'epoch': 2.84}
[WARNING|2025-03-09 01:50:55] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:50:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:00] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:05] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:51:07] logging.py:157 >> {'loss': 0.0263, 'learning_rate': 1.4661e-07, 'epoch': 2.86}
[WARNING|2025-03-09 01:51:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:10] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:51:19] logging.py:157 >> {'loss': 0.0175, 'learning_rate': 1.0318e-07, 'epoch': 2.88}
[WARNING|2025-03-09 01:51:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:20] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:23] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:24] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:25] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:26] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:28] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:29] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:30] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:51:31] logging.py:157 >> {'loss': 0.0379, 'learning_rate': 6.7337e-08, 'epoch': 2.91}
[WARNING|2025-03-09 01:51:31] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:32] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:34] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:35] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:36] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:37] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:39] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:40] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:41] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:42] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:51:43] logging.py:157 >> {'loss': 0.0401, 'learning_rate': 3.9102e-08, 'epoch': 2.93}
[WARNING|2025-03-09 01:51:43] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:45] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:46] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:47] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:48] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:50] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:51] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:52] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:53] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:54] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:51:56] logging.py:157 >> {'loss': 0.0174, 'learning_rate': 1.8486e-08, 'epoch': 2.95}
[WARNING|2025-03-09 01:51:56] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:57] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:58] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:51:59] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:01] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:02] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:03] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:04] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:06] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:07] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:52:08] logging.py:157 >> {'loss': 0.0218, 'learning_rate': 5.5007e-09, 'epoch': 2.97}
[WARNING|2025-03-09 01:52:08] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:09] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:11] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:12] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:13] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:14] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:16] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:17] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:18] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:19] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:52:21] logging.py:157 >> {'loss': 0.0288, 'learning_rate': 1.5281e-10, 'epoch': 2.99}
[WARNING|2025-03-09 01:52:21] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[WARNING|2025-03-09 01:52:22] logging.py:168 >> 'Seq2SeqTrainingArguments' object has no attribute 'average_tokens_across_devices'
[INFO|2025-03-09 01:52:23] trainer.py:3801 >> Saving model checkpoint to saves/Mistral-7B-Instruct-v0.3/lora/treino_novo_mistral/checkpoint-1392
[INFO|2025-03-09 01:52:23] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unsloth--mistral-7b-instruct-v0.3-bnb-4bit/snapshots/d5f623888f1415cf89b5c208d09cb620694618ee/config.json
[INFO|2025-03-09 01:52:23] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "unsloth/Mistral-7B-Instruct-v0.3",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"quantization_config": {
"_load_in_4bit": true,
"_load_in_8bit": false,
"bnb_4bit_compute_dtype": "bfloat16",
"bnb_4bit_quant_storage": "uint8",
"bnb_4bit_quant_type": "nf4",
"bnb_4bit_use_double_quant": true,
"llm_int8_enable_fp32_cpu_offload": false,
"llm_int8_has_fp16_weight": false,
"llm_int8_skip_modules": null,
"llm_int8_threshold": 6.0,
"load_in_4bit": true,
"load_in_8bit": false,
"quant_method": "bitsandbytes"
},
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"unsloth_version": "2024.9",
"use_cache": true,
"vocab_size": 32768
}
[INFO|2025-03-09 01:52:24] <string>:461 >>
Training completed. Do not forget to share your model on huggingface.co/models =)
[INFO|2025-03-09 01:52:24] trainer.py:3801 >> Saving model checkpoint to saves/Mistral-7B-Instruct-v0.3/lora/treino_novo_mistral
[INFO|2025-03-09 01:52:24] configuration_utils.py:679 >> loading configuration file config.json from cache at /home/zeus/.cache/huggingface/hub/models--unsloth--mistral-7b-instruct-v0.3-bnb-4bit/snapshots/d5f623888f1415cf89b5c208d09cb620694618ee/config.json
[INFO|2025-03-09 01:52:24] configuration_utils.py:746 >> Model config MistralConfig {
"_name_or_path": "unsloth/Mistral-7B-Instruct-v0.3",
"architectures": [
"MistralForCausalLM"
],
"attention_dropout": 0.0,
"bos_token_id": 1,
"eos_token_id": 2,
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 4096,
"initializer_range": 0.02,
"intermediate_size": 14336,
"max_position_embeddings": 32768,
"model_type": "mistral",
"num_attention_heads": 32,
"num_hidden_layers": 32,
"num_key_value_heads": 8,
"pad_token_id": 770,
"quantization_config": {
"_load_in_4bit": true,
"_load_in_8bit": false,
"bnb_4bit_compute_dtype": "bfloat16",
"bnb_4bit_quant_storage": "uint8",
"bnb_4bit_quant_type": "nf4",
"bnb_4bit_use_double_quant": true,
"llm_int8_enable_fp32_cpu_offload": false,
"llm_int8_has_fp16_weight": false,
"llm_int8_skip_modules": null,
"llm_int8_threshold": 6.0,
"load_in_4bit": true,
"load_in_8bit": false,
"quant_method": "bitsandbytes"
},
"rms_norm_eps": 1e-05,
"rope_theta": 1000000.0,
"sliding_window": null,
"tie_word_embeddings": false,
"torch_dtype": "bfloat16",
"transformers_version": "4.46.1",
"unsloth_version": "2024.9",
"use_cache": true,
"vocab_size": 32768
}
[WARNING|2025-03-09 01:52:24] logging.py:162 >> No metric eval_loss to plot.
[WARNING|2025-03-09 01:52:24] logging.py:162 >> No metric eval_accuracy to plot.
[INFO|2025-03-09 01:52:24] modelcard.py:449 >> Dropping the following result as it does not have all the necessary fields:
{'task': {'name': 'Causal Language Modeling', 'type': 'text-generation'}}