File size: 301 Bytes
e3d3417
 
64b7ce2
 
 
 
 
e3d3417
 
64b7ce2
e3d3417
64b7ce2
e3d3417
64b7ce2
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
---
library_name: transformers
datasets:
- monology/pile-uncopyrighted
base_model:
- state-spaces/mamba-2.8b-hf
pipeline_tag: text-generation
---

# MambaConstrict

Best performing MambaConstrict model trained using $\ell_2$-norm regularization.

For inquiring, please contact teilers@student.ethz.ch