Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceH4
/
Qwen2.5-1.5B-Instruct-gkd
like
2
Follow
Hugging Face H4
1.45k
TensorBoard
Safetensors
qwen2
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
Qwen2.5-1.5B-Instruct-gkd
40.6 GB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
kashif
HF Staff
Upload GKD trained model and tokenizer
38fdb38
verified
10 months ago
checkpoint-172
Upload GKD trained model and tokenizer
10 months ago
checkpoint-500
Upload GKD trained model and tokenizer
10 months ago
checkpoint-514
Upload GKD trained model and tokenizer
10 months ago
runs
Upload GKD trained model and tokenizer
10 months ago
.gitattributes
1.77 kB
Upload GKD trained model and tokenizer
10 months ago
added_tokens.json
Safe
605 Bytes
Upload GKD trained model and tokenizer
10 months ago
chat_template.jinja
Safe
2.51 kB
Upload GKD trained model and tokenizer
10 months ago
config.json
1.33 kB
Upload GKD trained model and tokenizer
10 months ago
generation_config.json
Safe
247 Bytes
Upload GKD trained model and tokenizer
10 months ago
merges.txt
Safe
1.67 MB
Upload GKD trained model and tokenizer
10 months ago
model.safetensors
3.55 GB
xet
Upload GKD trained model and tokenizer
10 months ago
special_tokens_map.json
Safe
613 Bytes
Upload GKD trained model and tokenizer
10 months ago
tokenizer.json
11.4 MB
xet
Upload GKD trained model and tokenizer
10 months ago
tokenizer_config.json
Safe
4.71 kB
Upload GKD trained model and tokenizer
10 months ago
training_args.bin
pickle
Detected Pickle imports (14)
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
,
"transformers.trainer_utils.SaveStrategy"
,
"transformers.training_args.OptimizerNames"
,
"accelerate.utils.dataclasses.DistributedType"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_utils.HubStrategy"
,
"torch.bfloat16"
,
"torch.device"
,
"accelerate.state.PartialState"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.trainer_utils.IntervalStrategy"
,
"trl.trainer.gkd_config.GKDConfig"
,
"transformers.integrations.deepspeed.HfDeepSpeedConfig"
How to fix it?
8.21 kB
xet
Upload GKD trained model and tokenizer
10 months ago
vocab.json
Safe
2.78 MB
Upload GKD trained model and tokenizer
10 months ago