Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Zhayr1
/
BitMamba-2-1B

Text Generation
JAX
English
bitmamba
bitnet
mamba
ssm
1.58-bit
ternary
efficient-inference
Model card Files Files and versions
xet
Community
BitMamba-2-1B
4.72 GB
  • 1 contributor
History: 7 commits
Zhayr1's picture
Zhayr1
Update README.md
30440c0 verified 3 days ago
  • bitmamba_cpp
    Initial commit: Upload BitMamba-1B model, weights and benchmarks 7 days ago
  • jax_weights
    Initial commit: Upload BitMamba-1B model, weights and benchmarks 7 days ago
  • .gitattributes
    1.75 kB
    Upload BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf 7 days ago
  • BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf
    726 kB
    xet
    Upload BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf 7 days ago
  • README.md
    3.11 kB
    Update README.md 3 days ago
  • config.json
    408 Bytes
    Initial commit: Upload BitMamba-1B model, weights and benchmarks 7 days ago
  • tokenizer.json
    3.56 MB
    Initial commit: Upload BitMamba-1B model, weights and benchmarks 7 days ago
  • tokenizer_config.json
    286 Bytes
    Initial commit: Upload BitMamba-1B model, weights and benchmarks 7 days ago
  • training_loss_1b.png
    189 kB
    xet
    Initial commit: Upload BitMamba-1B model, weights and benchmarks 7 days ago