Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
mamba2-8b-3t-4k
like
21
Follow
NVIDIA
54.7k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
main
mamba2-8b-3t-4k
16.5 GB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
rwaleffe
Upload model
b915550
almost 2 years ago
release
Upload model
almost 2 years ago
.gitattributes
Safe
1.52 kB
initial commit
almost 2 years ago
README.md
Safe
2.16 kB
Upload model
almost 2 years ago
latest_checkpointed_iteration.txt
Safe
8 Bytes
Upload model
almost 2 years ago
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
Safe
4.57 MB
xet
Upload model
almost 2 years ago