nmcco/15_flashattn_gemma
f8e1b74 verified - 1.57 kB Training in progress, step 40
- 1.7 kB nmcco/15_flashattn_gemma
- 839 Bytes Training in progress, step 40
- 1.35 GB Training in progress, step 260
- 636 Bytes Training in progress, step 40
- 34.4 MB Training in progress, step 40
- 47 kB Training in progress, step 40
training_args.bin Detected Pickle imports (14)
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.trainer_utils.HubStrategy",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_utils.SchedulerType",
- "transformers.training_args.OptimizerNames",
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "torch.device",
- "torch.bfloat16",
- "trl.trainer.sft_config.SFTConfig",
- "transformers.trainer_utils.IntervalStrategy"
How to fix it?
6.84 kB Training in progress, step 40