Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
DevHunterAI
/
HSSM-v2-250M
like
1
Text Generation
PyTorch
HuggingFaceFW/fineweb-edu
English
hssm-v2
hierarchical-state-space-model
mixture-of-experts
autoregressive
fineweb-edu
250m-parameters
0.25B
Model card
Files
Files and versions
xet
Community
242bbe4
HSSM-v2-250M
780 kB
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
DevHunterAI
Upload hssm_pretrained_chat.py with huggingface_hub
242bbe4
verified
about 1 month ago
.gitattributes
1.58 kB
Upload HSSM_v2_architecture.png with huggingface_hub
about 1 month ago
HSSM_v2_architecture.png
728 kB
xet
Upload HSSM_v2_architecture.png with huggingface_hub
about 1 month ago
README.md
6.03 kB
Upload README.md with huggingface_hub
about 1 month ago
hssm_pretrained_chat.py
27.8 kB
Upload hssm_pretrained_chat.py with huggingface_hub
about 1 month ago
hssm_v2_gpu_pretrain.py
17 kB
Upload hssm_v2_gpu_pretrain.py with huggingface_hub
about 1 month ago