Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
jingyaogong
/
MiniMind2-Pytorch
like
8
arxiv:
2405.04434
arxiv:
2402.14905
arxiv:
2401.04088
Model card
Files
Files and versions
xet
Community
6d835dd
MiniMind2-Pytorch
3.83 GB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
jingyaogong
Delete rlhf_512.pth
6d835dd
verified
12 months ago
images
Delete images/1.png
about 1 year ago
.gitattributes
Safe
2.29 kB
Upload 14 files
about 1 year ago
README.md
Safe
91.1 kB
Upload 2 files
about 1 year ago
README_en.md
Safe
101 kB
Upload 2 files
about 1 year ago
full_sft_512.pth
103 MB
xet
Upload 12 files
about 1 year ago
full_sft_512_zero.pth
103 MB
xet
Upload 12 files
about 1 year ago
full_sft_640_moe.pth
580 MB
xet
Upload 12 files
about 1 year ago
full_sft_768.pth
416 MB
xet
Upload 12 files
about 1 year ago
pretrain_512.pth
103 MB
xet
Upload 12 files
about 1 year ago
pretrain_640_moe.pth
580 MB
xet
Upload 12 files
about 1 year ago
pretrain_768.pth
416 MB
xet
Upload 12 files
about 1 year ago
reason_512.pth
103 MB
xet
Upload 12 files
about 1 year ago
reason_768.pth
416 MB
xet
Upload 12 files
about 1 year ago
rlhf_640_moe.pth
580 MB
xet
Upload 12 files
about 1 year ago
rlhf_768.pth
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
416 MB
xet
Upload 12 files
about 1 year ago