36.7 GB

Ctrl+K

1 contributor

History: 14 commits

HectorHe

Training in progress, step 900

1f5b430 verified about 1 year ago

.gitattributes

1.52 kB
initial commit over 1 year ago
config.json

1.74 kB
Training in progress, step 20 about 1 year ago
expert_selection.log

8.28 kB
Training in progress, step 100 about 1 year ago
model-00001-of-00002.safetensors

4.9 GB
xet

Training in progress, step 900 about 1 year ago
model-00001-of-00007.safetensors

4.99 GB
xet

Training in progress, step 200 about 1 year ago
model-00002-of-00002.safetensors

419 MB
xet

Training in progress, step 900 about 1 year ago
model-00002-of-00007.safetensors

5 GB
xet

Training in progress, step 200 about 1 year ago
model-00003-of-00007.safetensors

5 GB
xet

Training in progress, step 200 about 1 year ago
model-00004-of-00007.safetensors

5 GB
xet

Training in progress, step 200 about 1 year ago
model-00005-of-00007.safetensors

5 GB
xet

Training in progress, step 200 about 1 year ago
model-00006-of-00007.safetensors

5 GB
xet

Training in progress, step 200 about 1 year ago
model-00007-of-00007.safetensors

1.44 GB
xet

Training in progress, step 200 about 1 year ago
model.safetensors.index.json

72.1 kB
Training in progress, step 20 about 1 year ago
special_tokens_map.json

369 Bytes
Training in progress, step 200 about 1 year ago
tokenizer.json

7.5 MB
Training in progress, step 100 about 1 year ago
tokenizer_config.json

4.42 kB
Training in progress, step 20 about 1 year ago
top_6_experts.json

1.24 kB
Training in progress, step 20 about 1 year ago
top_6_experts_lmms-lab_Math10K.json

1.24 kB
Training in progress, step 100 about 1 year ago
top_k_experts.json

942 Bytes
Training in progress, step 20 about 1 year ago
training.log

111 kB
Training in progress, step 300 about 1 year ago
training_4pus.log

75.1 kB
Training in progress, step 200 about 1 year ago
training_args.bin
Detected Pickle imports (14)
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.training_args.OptimizerNames",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "torch.bfloat16",
- "transformers.trainer_utils.SchedulerType",
- "trainers.efficient_distillation_config.EfficientDistillationConfig",
- "torch.device",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_pt_utils.AcceleratorConfig"
How to fix it?
7.99 kB
xet

Training in progress, step 300 about 1 year ago
training_backup.log

17.5 kB
Training in progress, step 200 about 1 year ago
training_distill.log

519 kB
Training in progress, step 20 about 1 year ago

Detected Pickle imports (14)