Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
bd4sur
/
Nano-168M
like
0
License:
mit
Model card
Files
Files and versions
xet
Community
00a39c8
Nano-168M
8.62 GB
Ctrl+K
Ctrl+K
1 contributor
History:
19 commits
bd4sur
Upload deepseek_qwen25_tokenizer.bin
00a39c8
verified
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
README.md
Safe
24 Bytes
initial commit
over 1 year ago
config_nano_168m_625000_sft_875000_20241220.json
Safe
999 Bytes
Upload 2 files
over 1 year ago
config_nano_168m_625000_sft_875000_amateur_radio_890000.json
Safe
999 Bytes
Upload config_nano_168m_625000_sft_875000_amateur_radio_890000.json
over 1 year ago
config_pretrain.json
Safe
998 Bytes
Upload 6 files
over 1 year ago
config_sft.json
Safe
921 Bytes
Upload 6 files
over 1 year ago
deepseek_qwen25_tokenizer.bin
2.19 MB
xet
Upload deepseek_qwen25_tokenizer.bin
about 1 year ago
nano_168m_625000.pt
2.05 GB
xet
Upload nano_168m_625000.pt
over 1 year ago
nano_168m_625000_sft_20241220.log
Safe
4.88 MB
Upload 2 files
over 1 year ago
nano_168m_625000_sft_786000.bin
674 MB
xet
Upload 2 files
over 1 year ago
nano_168m_625000_sft_786000.pt
pickle
Detected Pickle imports (5)
"torch.FloatStorage"
,
"model.ModelConfig"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"model.TrainConfig"
How to fix it?
2.05 GB
xet
Upload 2 files
over 1 year ago
nano_168m_625000_sft_875000_amateur_radio_890000.bin
674 MB
xet
Upload nano_168m_625000_sft_875000_amateur_radio_890000.bin
over 1 year ago
nano_168m_625000_sft_947000.bin
674 MB
xet
Upload nano_168m_625000_sft_947000.bin
over 1 year ago
nano_168m_625000_sft_947000.pt
pickle
Detected Pickle imports (5)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"model.TrainConfig"
,
"model.ModelConfig"
,
"collections.OrderedDict"
How to fix it?
2.05 GB
xet
Upload nano_168m_625000_sft_947000.pt
over 1 year ago
nano_168m_pt_1130.log
Safe
9.4 MB
Upload nano_168m_pt_1130.log
over 1 year ago
qwen25-0b5-instruct.bin
437 MB
xet
Upload 2 files
over 1 year ago
qwen25-tokenizer.bin
2.19 MB
xet
Upload 2 files
over 1 year ago
sft.log
Safe
1.1 MB
Upload 6 files
over 1 year ago