Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pomilon-lab
/
Aetheris-MoE-300M-A125M-base
like
0
Follow
Pomilon Intelligence Lab
2
Text Generation
PyTorch
cerebras/SlimPajama-627B
English
mamba
Mixture of Experts
hybrid-architecture
causal-lm
experimental
License:
mit
Model card
Files
Files and versions
xet
Community
main
Aetheris-MoE-300M-A125M-base
/
checkpoints
14.1 GB
1 contributor
History:
4 commits
Pomilon
Upload folder using huggingface_hub
3080185
verified
3 months ago
checkpoint_10000_step.pth
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
What is a pickle import?
3.53 GB
xet
Upload folder using huggingface_hub
3 months ago
checkpoint_11000_step.pth
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
3.53 GB
xet
Upload folder using huggingface_hub
3 months ago
checkpoint_12000.pth
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
3.53 GB
xet
Upload folder using huggingface_hub
3 months ago
checkpoint_17000_step.pth
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
3.53 GB
xet
Upload folder using huggingface_hub
3 months ago