Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
gmongaras
/
medium_8192sl_gpu_64bs__mamba
like
0
Safetensors
llama
arxiv:
2602.17363
Model card
Files
Files and versions
xet
Community
main
medium_8192sl_gpu_64bs__mamba
8.19 GB
1 contributor
History:
3 commits
gmongaras
Create README.md
f3e45cc
verified
9 days ago
.gitattributes
1.52 kB
initial commit
24 days ago
README.md
528 Bytes
Create README.md
9 days ago
config.json
751 Bytes
Upload folder using huggingface_hub
24 days ago
config.pt
1.24 kB
xet
Upload folder using huggingface_hub
24 days ago
generation_config.json
111 Bytes
Upload folder using huggingface_hub
24 days ago
model.safetensors
2.73 GB
xet
Upload folder using huggingface_hub
24 days ago
optimizer.pt
5.46 GB
xet
Upload folder using huggingface_hub
24 days ago
scaler.pt
1.24 kB
xet
Upload folder using huggingface_hub
24 days ago
scheduler.pt
1 kB
xet
Upload folder using huggingface_hub
24 days ago
special_tokens_map.json
437 Bytes
Upload folder using huggingface_hub
24 days ago
tokenizer.model
500 kB
xet
Upload folder using huggingface_hub
24 days ago
tokenizer.pt
651 kB
xet
Upload folder using huggingface_hub
24 days ago
tokenizer_config.json
993 Bytes
Upload folder using huggingface_hub
24 days ago