Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
bd4sur
/
Nano-168M
like
0
License:
mit
Model card
Files
Files and versions
xet
Community
main
Nano-168M
18.2 GB
Ctrl+K
Ctrl+K
1 contributor
History:
27 commits
bd4sur
Upload nano_168m_625000_sft_947000_q80.bin
88a40f5
verified
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
README.md
Safe
24 Bytes
initial commit
over 1 year ago
config_nano_168m_625000_sft_875000_20241220.json
Safe
999 Bytes
Upload 2 files
about 1 year ago
config_nano_168m_625000_sft_875000_amateur_radio_890000.json
Safe
999 Bytes
Upload config_nano_168m_625000_sft_875000_amateur_radio_890000.json
over 1 year ago
config_pretrain.json
Safe
998 Bytes
Upload 6 files
over 1 year ago
config_sft.json
Safe
921 Bytes
Upload 6 files
over 1 year ago
deepseek-r1-qwen25-1b5.bin
7.18 GB
xet
Upload deepseek-r1-qwen25-1b5.bin
about 1 year ago
deepseek_qwen25_tokenizer.bin
2.19 MB
xet
Upload deepseek_qwen25_tokenizer.bin
about 1 year ago
nano_168m_625000.pt
2.05 GB
xet
Upload nano_168m_625000.pt
over 1 year ago
nano_168m_625000_sft_20241220.log
Safe
4.88 MB
Upload 2 files
about 1 year ago
nano_168m_625000_sft_786000.bin
674 MB
xet
Upload 2 files
over 1 year ago
nano_168m_625000_sft_786000.pt
pickle
Detected Pickle imports (5)
"torch.FloatStorage"
,
"model.ModelConfig"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"model.TrainConfig"
How to fix it?
2.05 GB
xet
Upload 2 files
over 1 year ago
nano_168m_625000_sft_875000_amateur_radio_890000.bin
674 MB
xet
Upload nano_168m_625000_sft_875000_amateur_radio_890000.bin
over 1 year ago
nano_168m_625000_sft_947000.pt
pickle
Detected Pickle imports (5)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"model.TrainConfig"
,
"model.ModelConfig"
,
"collections.OrderedDict"
How to fix it?
2.05 GB
xet
Upload nano_168m_625000_sft_947000.pt
about 1 year ago
nano_168m_625000_sft_947000_2508.bin
674 MB
xet
Upload 2 files
4 months ago
nano_168m_625000_sft_947000_2512.bin
674 MB
xet
Upload 2 files
4 months ago
nano_168m_625000_sft_947000_q80.bin
174 MB
xet
Upload nano_168m_625000_sft_947000_q80.bin
4 months ago
nano_168m_pt_1130.log
Safe
9.4 MB
Upload nano_168m_pt_1130.log
over 1 year ago
qwen25-0b5-instruct.bin
1.98 GB
xet
Upload qwen25-0b5-instruct.bin
11 months ago
qwen25-tokenizer.bin
2.19 MB
xet
Upload 2 files
about 1 year ago
sft.log
Safe
1.1 MB
Upload 6 files
over 1 year ago