Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Zhayr1
/
BitMamba-2-1B
like
6
Text Generation
JAX
HuggingFaceFW/fineweb-edu
bigcode/the-stack-dedup
HuggingFaceTB/cosmopedia
English
bitmamba
bitnet
mamba
ssm
1.58-bit
ternary
efficient-inference
License:
mit
Model card
Files
Files and versions
xet
Community
main
BitMamba-2-1B
4.72 GB
1 contributor
History:
7 commits
Zhayr1
Update README.md
30440c0
verified
3 days ago
bitmamba_cpp
Initial commit: Upload BitMamba-1B model, weights and benchmarks
7 days ago
jax_weights
Initial commit: Upload BitMamba-1B model, weights and benchmarks
7 days ago
.gitattributes
1.75 kB
Upload BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf
7 days ago
BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf
726 kB
xet
Upload BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf
7 days ago
README.md
3.11 kB
Update README.md
3 days ago
config.json
408 Bytes
Initial commit: Upload BitMamba-1B model, weights and benchmarks
7 days ago
tokenizer.json
3.56 MB
Initial commit: Upload BitMamba-1B model, weights and benchmarks
7 days ago
tokenizer_config.json
286 Bytes
Initial commit: Upload BitMamba-1B model, weights and benchmarks
7 days ago
training_loss_1b.png
189 kB
xet
Initial commit: Upload BitMamba-1B model, weights and benchmarks
7 days ago