Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceH4
/
Qwen2.5-1.5B-Instruct-gkd
like
2
Follow
Hugging Face H4
1.47k
TensorBoard
Safetensors
qwen2
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
Qwen2.5-1.5B-Instruct-gkd
/
checkpoint-500
5.94 GB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
kashif
HF Staff
Upload GKD trained model and tokenizer
6b0d76d
verified
11 months ago
added_tokens.json
Safe
80 Bytes
Upload GKD trained model and tokenizer
11 months ago
chat_template.jinja
Safe
328 Bytes
Upload GKD trained model and tokenizer
11 months ago
config.json
1.24 kB
Upload GKD trained model and tokenizer
11 months ago
generation_config.json
Safe
247 Bytes
Upload GKD trained model and tokenizer
11 months ago
merges.txt
Safe
1.67 MB
Upload GKD trained model and tokenizer
11 months ago
model.safetensors
1.98 GB
xet
Upload GKD trained model and tokenizer
11 months ago
optimizer.pt
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
3.95 GB
xet
Upload GKD trained model and tokenizer
11 months ago
rng_state_0.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"_codecs.encode"
,
"numpy.core.multiarray._reconstruct"
,
"numpy.dtype"
,
"numpy.ndarray"
,
"torch.ByteStorage"
How to fix it?
14.9 kB
xet
Upload GKD trained model and tokenizer
11 months ago
rng_state_1.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"_codecs.encode"
,
"numpy.core.multiarray._reconstruct"
,
"numpy.dtype"
,
"numpy.ndarray"
,
"torch.ByteStorage"
How to fix it?
14.9 kB
xet
Upload GKD trained model and tokenizer
11 months ago
scheduler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.47 kB
xet
Upload GKD trained model and tokenizer
11 months ago
special_tokens_map.json
Safe
367 Bytes
Upload GKD trained model and tokenizer
11 months ago
tokenizer.json
11.4 MB
xet
Upload GKD trained model and tokenizer
11 months ago
tokenizer_config.json
Safe
999 Bytes
Upload GKD trained model and tokenizer
11 months ago
trainer_state.json
9.44 kB
Upload GKD trained model and tokenizer
11 months ago
training_args.bin
pickle
Detected Pickle imports (10)
"transformers.trainer_utils.SaveStrategy"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.IntervalStrategy"
,
"accelerate.utils.dataclasses.DistributedType"
,
"torch.device"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"trl.trainer.gkd_config.GKDConfig"
How to fix it?
6.8 kB
xet
Upload GKD trained model and tokenizer
11 months ago
vocab.json
Safe
2.78 MB
Upload GKD trained model and tokenizer
11 months ago