Improve language tag
130d8ca verified - 1.57 kB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.86 kB Improve language tag
- 605 Bytes Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 218 Bytes Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 736 Bytes Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 243 Bytes Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 1.67 MB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.89 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 4.88 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 2.12 GB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 63.2 kB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 613 Bytes Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 11.4 MB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 7.36 kB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 218 Bytes Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 6.12 kB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 6.33 kB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
training_args.bin Detected Pickle imports (14)
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "llamafactory.hparams.training_args.TrainingArguments",
- "torch.device",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.utils.dataclasses.DistributedType",
- "torch.bfloat16",
- "transformers.training_args.OptimizerNames",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_utils.SaveStrategy"
How to fix it?
8.06 kB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1 - 39 kB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1
- 2.78 MB Initial upload of qwen2.5-32b-instruct_deepseek-reasoner_2004_03-10-21_lr1e-5_wd1e-4_epo5_len32768_tbs1