Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Youwongai
/
theo-ultimate
like
0
Text Generation
English
custom
theo
y-ai
hymba
mamba
mamba3
ssm
Mixture of Experts
mixture-of-experts
hybrid
recurrent
conversational
chatbot
bfloat16
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
theo-ultimate
22.2 GB
Ctrl+K
Ctrl+K
1 contributor
History:
25 commits
Youwongai
Cell5: Theo surgery on Qwen3.5-0.8B
8d22c48
verified
about 20 hours ago
checkpoints
Upload checkpoints/theo_epoch10.pt with huggingface_hub
about 22 hours ago
.gitattributes
Safe
1.57 kB
Cell5: Theo surgery on Qwen3.5-0.8B
about 20 hours ago
README.md
6.9 kB
Create README.md
about 22 hours ago
chat_template.jinja
Safe
7.76 kB
Cell5: Theo surgery on Qwen3.5-0.8B
about 20 hours ago
config.json
398 Bytes
Cell5: Theo surgery on Qwen3.5-0.8B
about 20 hours ago
corpus.txt
23.2 kB
Upload corpus.txt with huggingface_hub
about 22 hours ago
theo_best.pt
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
2.24 GB
xet
Cell5: Theo surgery on Qwen3.5-0.8B
about 20 hours ago
theo_final.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.61 GB
xet
Upload theo_final.pt with huggingface_hub
about 22 hours ago
theo_qwen_surgery.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
What is a pickle import?
2.24 GB
xet
Cell5: Theo surgery on Qwen3.5-0.8B
about 20 hours ago
tokenizer.json
Safe
20 MB
xet
Cell5: Theo surgery on Qwen3.5-0.8B
about 20 hours ago
tokenizer_config.json
Safe
1.13 kB
Cell5: Theo surgery on Qwen3.5-0.8B
about 20 hours ago