File size: 320 Bytes
ff77f8e
 
d7b9f6b
 
 
 
 
ff77f8e
 
d7b9f6b
ff77f8e
d7b9f6b
ff77f8e
d7b9f6b
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
---
library_name: transformers
datasets:
- monology/pile-uncopyrighted
base_model:
- state-spaces/mamba-2.8b-hf
pipeline_tag: text-generation
---

# MambaConstrict TD

MambaConstrict Model trained using a temporal difference regularization with $\lambda = 0.0001$.

For inquiring, please contact teilers@student.ethz.ch