Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bd4sur
/
Nano-168M
like
0
License:
mit
Model card
Files
Files and versions
xet
Community
main
Nano-168M
18.2 GB
1 contributor
History:
27 commits
bd4sur
Upload nano_168m_625000_sft_947000_q80.bin
88a40f5
verified
17 days ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
README.md
24 Bytes
initial commit
about 1 year ago
config_nano_168m_625000_sft_875000_20241220.json
999 Bytes
Upload 2 files
12 months ago
config_nano_168m_625000_sft_875000_amateur_radio_890000.json
999 Bytes
Upload config_nano_168m_625000_sft_875000_amateur_radio_890000.json
12 months ago
config_pretrain.json
998 Bytes
Upload 6 files
about 1 year ago
config_sft.json
921 Bytes
Upload 6 files
about 1 year ago
deepseek-r1-qwen25-1b5.bin
7.18 GB
xet
Upload deepseek-r1-qwen25-1b5.bin
10 months ago
deepseek_qwen25_tokenizer.bin
2.19 MB
xet
Upload deepseek_qwen25_tokenizer.bin
10 months ago
nano_168m_625000.pt
2.05 GB
xet
Upload nano_168m_625000.pt
about 1 year ago
nano_168m_625000_sft_20241220.log
4.88 MB
Upload 2 files
12 months ago
nano_168m_625000_sft_786000.bin
674 MB
xet
Upload 2 files
12 months ago
nano_168m_625000_sft_786000.pt
2.05 GB
xet
Upload 2 files
12 months ago
nano_168m_625000_sft_875000_amateur_radio_890000.bin
674 MB
xet
Upload nano_168m_625000_sft_875000_amateur_radio_890000.bin
12 months ago
nano_168m_625000_sft_947000.pt
2.05 GB
xet
Upload nano_168m_625000_sft_947000.pt
12 months ago
nano_168m_625000_sft_947000_2508.bin
674 MB
xet
Upload 2 files
22 days ago
nano_168m_625000_sft_947000_2512.bin
674 MB
xet
Upload 2 files
22 days ago
nano_168m_625000_sft_947000_q80.bin
174 MB
xet
Upload nano_168m_625000_sft_947000_q80.bin
17 days ago
nano_168m_pt_1130.log
9.4 MB
Upload nano_168m_pt_1130.log
about 1 year ago
qwen25-0b5-instruct.bin
1.98 GB
xet
Upload qwen25-0b5-instruct.bin
8 months ago
qwen25-tokenizer.bin
2.19 MB
xet
Upload 2 files
12 months ago
sft.log
1.1 MB
Upload 6 files
about 1 year ago