Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Youwongai
/
theo-ultimate

Text Generation
English
custom
theo
y-ai
hymba
mamba
mamba3
ssm
Mixture of Experts
mixture-of-experts
hybrid
recurrent
conversational
chatbot
bfloat16
Model card Files Files and versions
xet
Community
theo-ultimate
22.2 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 25 commits
Youwongai's picture
Youwongai
Cell5: Theo surgery on Qwen3.5-0.8B
8d22c48 verified about 20 hours ago
  • checkpoints
    Upload checkpoints/theo_epoch10.pt with huggingface_hub about 22 hours ago
  • .gitattributes
    1.57 kB
    Cell5: Theo surgery on Qwen3.5-0.8B about 20 hours ago
  • README.md
    6.9 kB
    Create README.md about 22 hours ago
  • chat_template.jinja
    7.76 kB
    Cell5: Theo surgery on Qwen3.5-0.8B about 20 hours ago
  • config.json
    398 Bytes
    Cell5: Theo surgery on Qwen3.5-0.8B about 20 hours ago
  • corpus.txt
    23.2 kB
    Upload corpus.txt with huggingface_hub about 22 hours ago
  • theo_best.pt

    Detected Pickle imports (3)

    • "torch.BFloat16Storage",
    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict"

    What is a pickle import?

    2.24 GB
    xet
    Cell5: Theo surgery on Qwen3.5-0.8B about 20 hours ago
  • theo_final.pt

    Detected Pickle imports (3)

    • "collections.OrderedDict",
    • "torch.BFloat16Storage",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    1.61 GB
    xet
    Upload theo_final.pt with huggingface_hub about 22 hours ago
  • theo_qwen_surgery.pt

    Detected Pickle imports (3)

    • "collections.OrderedDict",
    • "torch._utils._rebuild_tensor_v2",
    • "torch.BFloat16Storage"

    What is a pickle import?

    2.24 GB
    xet
    Cell5: Theo surgery on Qwen3.5-0.8B about 20 hours ago
  • tokenizer.json
    20 MB
    xet
    Cell5: Theo surgery on Qwen3.5-0.8B about 20 hours ago
  • tokenizer_config.json
    1.13 kB
    Cell5: Theo surgery on Qwen3.5-0.8B about 20 hours ago