Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
batteryphil
/
mamba-2.8b-latent
like
3
Text Generation
Safetensors
English
mamba
state-space-model
latent-reasoning
reinforcement-learning
mathematical-logic
oo-domain
lora
proprioception
arxiv:
6 papers
License:
mit
Model card
Files
Files and versions
xet
Community
main
mamba-2.8b-latent
5.56 GB
Ctrl+K
Ctrl+K
1 contributor
History:
19 commits
batteryphil
Phase 10: benchmark_pre_hf.py
438b6bd
verified
4 days ago
.gitattributes
Safe
1.52 kB
initial commit
15 days ago
README.md
Safe
9.37 kB
Phase 10: README.md
4 days ago
benchmark_pre_hf.py
25.6 kB
Phase 10: benchmark_pre_hf.py
4 days ago
config.json
Safe
920 Bytes
Upload mamba-2.8b-latent engine (Phase 1-7 pipeline)
15 days ago
engine_manifest.json
Safe
1.11 kB
Upload mamba-2.8b-latent engine (Phase 1-7 pipeline)
15 days ago
generation_config.json
Safe
131 Bytes
Upload mamba-2.8b-latent engine (Phase 1-7 pipeline)
15 days ago
halting_head.pt
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
5.38 MB
xet
Upload mamba-2.8b-latent engine (Phase 1-7 pipeline)
15 days ago
halting_head_v2.pt
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
5.38 MB
xet
Phase 10: halting_head_v2.pt
4 days ago
lora_mamba.py
4.28 kB
Phase 10: lora_mamba.py
4 days ago
lora_oo_r16_final.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
2.1 MB
xet
Phase 10: lora_oo_r16_final.pt
4 days ago
model.safetensors
Safe
5.55 GB
xet
Upload model.safetensors with huggingface_hub
12 days ago
proprio_gate_2.8b.pt
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
17.1 kB
xet
Phase 10: proprio_gate_2.8b.pt
4 days ago
proprioception_gate.py
3.6 kB
Phase 10: proprioception_gate.py
4 days ago
run.py
Safe
12 kB
Add CPU mode: native HF Mamba, no mamba-ssm required, float32 fallback
15 days ago
tokenizer.json
Safe
3.56 MB
Upload tokenizer
13 days ago
tokenizer_config.json
Safe
410 Bytes
Upload tokenizer
12 days ago
training_args.bin
pickle
Detected Pickle imports (10)
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"torch.device"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.training_args.TrainingArguments"
,
"transformers.trainer_utils.IntervalStrategy"
,
"accelerate.state.PartialState"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_utils.SaveStrategy"
How to fix it?
5.2 kB
xet
Upload mamba-2.8b-latent engine (Phase 1-7 pipeline)
15 days ago