Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
serda-dev
/
mamba-370m-hf-turkish
like
0
Text Generation
Transformers
Safetensors
Turkish
English
mamba
state-space-model
ssm
causal-lm
continued-pretraining
text-generation-inference
arxiv:
2312.00752
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
mamba-370m-hf-turkish
2.24 GB
Ctrl+K
Ctrl+K
1 contributor
History:
9 commits
serda-dev
Update README.md
d84ec78
verified
28 days ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
README.md
8.06 kB
Update README.md
28 days ago
config.json
Safe
931 Bytes
new trained tensors.
28 days ago
generation_config.json
Safe
132 Bytes
new trained tensors.
28 days ago
model.safetensors
746 MB
xet
new trained tensors.
28 days ago
tokenizer.json
Safe
3.56 MB
Upload tokenizer
about 2 months ago
tokenizer_config.json
Safe
391 Bytes
Upload tokenizer
about 2 months ago
training_state.pt
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
,
"torch.FloatStorage"
What is a pickle import?
1.49 GB
xet
new trained tensors.
28 days ago