Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
manu
/
llama-wikitext
like
0
Text Generation
Transformers
PyTorch
wikitext
llama
Generated from Trainer
text-generation-inference
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
llama-wikitext
1.76 GB
1 contributor
History:
12 commits
SFconvertbot
Adding `safetensors` variant of this model
b68e6ef
verified
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 2 years ago
README.md
Safe
1.33 kB
End of training
over 2 years ago
all_results.json
Safe
194 Bytes
End of training
over 2 years ago
config.json
Safe
708 Bytes
Model save
over 2 years ago
generation_config.json
111 Bytes
Model save
over 2 years ago
model.safetensors
880 MB
xet
Adding `safetensors` variant of this model
about 1 year ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
880 MB
xet
Model save
over 2 years ago
special_tokens_map.json
Safe
414 Bytes
Model save
over 2 years ago
tokenizer.json
Safe
1.35 MB
Model save
over 2 years ago
tokenizer_config.json
Safe
18.6 kB
Model save
over 2 years ago
train_results.json
Safe
194 Bytes
End of training
over 2 years ago
trainer_state.json
Safe
5.33 kB
End of training
over 2 years ago
training_args.bin
pickle
Detected Pickle imports (11)
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_utils.HubStrategy"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"transformers.training_args.OptimizerNames"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.TrainingArguments"
,
"torch.device"
,
"torch.bfloat16"
How to fix it?
5.05 kB
xet
Training in progress, step 100
over 2 years ago