Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
yzjwork
/
MiniMind2_Pytorch_test
like
0
arxiv:
2405.04434
arxiv:
2402.14905
arxiv:
2401.04088
Model card
Files
Files and versions
xet
Community
main
MiniMind2_Pytorch_test
3.93 GB
1 contributor
History:
2 commits
yzjwork
Upload 29 files
3f45b00
verified
10 months ago
images
Upload 29 files
10 months ago
.gitattributes
Safe
2.34 kB
Upload 29 files
10 months ago
README.md
Safe
92.4 kB
Upload 29 files
10 months ago
README_en.md
Safe
102 kB
Upload 29 files
10 months ago
full_sft_512.pth
103 MB
xet
Upload 29 files
10 months ago
full_sft_512_zero.pth
103 MB
xet
Upload 29 files
10 months ago
full_sft_640_moe.pth
580 MB
xet
Upload 29 files
10 months ago
full_sft_768.pth
416 MB
xet
Upload 29 files
10 months ago
pretrain_512.pth
103 MB
xet
Upload 29 files
10 months ago
pretrain_640_moe.pth
580 MB
xet
Upload 29 files
10 months ago
pretrain_768.pth
416 MB
xet
Upload 29 files
10 months ago
reason_512.pth
103 MB
xet
Upload 29 files
10 months ago
reason_768.pth
416 MB
xet
Upload 29 files
10 months ago
rlhf_512.pth
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
103 MB
xet
Upload 29 files
10 months ago
rlhf_640_moe.pth
580 MB
xet
Upload 29 files
10 months ago
rlhf_768.pth
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
416 MB
xet
Upload 29 files
10 months ago