Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zen-E
/
SSA-1B
like
0
Safetensors
EleutherAI/SmolLM2-135M-100B
English
llama-nsa
sparse_attention
pretrain
custom_code
arxiv:
2511.20102
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
SSA-1B
2.97 GB
1 contributor
History:
9 commits
zen-E
Update README.md
e613e4b
verified
1 day ago
.gitattributes
1.57 kB
Upload 10 files
7 days ago
README.md
1.38 kB
Update README.md
1 day ago
config.json
908 Bytes
Upload 10 files
7 days ago
configuration_llama_nsa.py
12.4 kB
Update configuration_llama_nsa.py
7 days ago
generation_config.json
143 Bytes
Upload 10 files
7 days ago
model.safetensors
2.95 GB
xet
Upload 10 files
7 days ago
modeling_llama_nsa.py
25.7 kB
Update modeling_llama_nsa.py
7 days ago
special_tokens_map.json
335 Bytes
Upload 10 files
7 days ago
tokenizer.json
17.2 MB
xet
Upload 10 files
7 days ago
tokenizer_config.json
50.6 kB
Upload 10 files
7 days ago