Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
gmongaras
/
medium_8192sl_gpu_64bs__softmax
like
0
Safetensors
llama
arxiv:
2602.17363
Model card
Files
Files and versions
xet
Community
main
medium_8192sl_gpu_64bs__softmax
8.83 GB
1 contributor
History:
4 commits
gmongaras
Update README.md
0d2afad
verified
9 days ago
.gitattributes
Safe
1.52 kB
initial commit
24 days ago
README.md
508 Bytes
Update README.md
9 days ago
config.json
Safe
753 Bytes
Upload folder using huggingface_hub
24 days ago
config.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.31 kB
xet
Upload folder using huggingface_hub
24 days ago
generation_config.json
Safe
111 Bytes
Upload folder using huggingface_hub
24 days ago
model.safetensors
Safe
2.94 GB
xet
Upload folder using huggingface_hub
24 days ago
optimizer.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
5.88 GB
xet
Upload folder using huggingface_hub
24 days ago
scaler.pt
pickle
Detected Pickle imports (3)
"collections.defaultdict"
,
"torch.amp.grad_scaler.GradScaler"
,
"torch.amp.grad_scaler._refresh_per_optimizer_state"
How to fix it?
1.24 kB
xet
Upload folder using huggingface_hub
24 days ago
scheduler.pt
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1 kB
xet
Upload folder using huggingface_hub
24 days ago
special_tokens_map.json
Safe
437 Bytes
Upload folder using huggingface_hub
24 days ago
tokenizer.model
Safe
500 kB
xet
Upload folder using huggingface_hub
24 days ago
tokenizer.pt
pickle
Detected Pickle imports (5)
"transformers.models.llama.tokenization_llama.LlamaTokenizer"
,
"__builtin__.set"
,
"transformers.tokenization_utils.Trie"
,
"_codecs.encode"
,
"tokenizers.AddedToken"
How to fix it?
651 kB
xet
Upload folder using huggingface_hub
24 days ago
tokenizer_config.json
Safe
993 Bytes
Upload folder using huggingface_hub
24 days ago