Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Yewei-Liu
/
SHINE-ift_mqa_1qa
like
0
Text Generation
Transformers
arxiv:
2602.06358
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
SHINE-ift_mqa_1qa
3.55 GB
Ctrl+K
Ctrl+K
2 contributors
History:
11 commits
Yewei-Liu
Delete upload.py
79707b4
verified
9 days ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
README.md
Safe
1.6 kB
Update README.md
about 2 months ago
mem_tokens.pt
pickle
Detected Pickle imports (4)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_parameter_with_state"
,
"collections.OrderedDict"
How to fix it?
2.43 MB
xet
Move checkpoint-epoch-1/mem_tokens.pt to root
9 days ago
metalora.pth
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.4 GB
xet
Move checkpoint-epoch-1/metalora.pth to root
9 days ago
metanetwork.pth
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
2.15 GB
xet
Move checkpoint-epoch-1/metanetwork.pth to root
9 days ago
trainer_state.json
Safe
26 Bytes
Move checkpoint-epoch-1/trainer_state.json to root
9 days ago
trainer_state.pt
pickle
Detected Pickle imports (7)
"numpy.ndarray"
,
"collections.OrderedDict"
,
"_codecs.encode"
,
"numpy.dtype"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.ByteStorage"
,
"numpy._core.multiarray._reconstruct"
How to fix it?
16.1 kB
xet
Move checkpoint-epoch-1/trainer_state.pt to root
9 days ago