Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jonathanjordan21
/
mos-mamba-6x130m-train
like
0
Text Generation
Transformers
Safetensors
MoSMamba
custom_code
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
mos-mamba-6x130m-train
795 MB
1 contributor
History:
8 commits
jonathanjordan21
Upload model
4b3eb9c
verified
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
README.md
Safe
5.17 kB
Upload MoSMambaForCausalLM
over 1 year ago
adapter_config.json
Safe
782 Bytes
Upload model
over 1 year ago
adapter_model.safetensors
217 MB
xet
Upload model
over 1 year ago
config.json
Safe
1.27 kB
Upload MoSMambaForCausalLM
over 1 year ago
generation_config.json
Safe
132 Bytes
Upload MoSMambaForCausalLM
over 1 year ago
model.safetensors
Safe
576 MB
xet
Upload MoSMambaForCausalLM
over 1 year ago
special_tokens_map.json
Safe
587 Bytes
Upload tokenizer
over 1 year ago
tokenizer.json
Safe
2.11 MB
Upload tokenizer
over 1 year ago
tokenizer_config.json
Safe
4.95 kB
Upload tokenizer
over 1 year ago