Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

prometheus04
/
matilda-mini-v2

Model card Files Files and versions
xet
Community
matilda-mini-v2 / scripts
14.5 kB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 3 commits
prometheus04's picture
prometheus04
cleanup: drop abandoned 350M config and v1-only data prep
2b4c3b8 verified about 1 month ago
  • ablate.py
    7.08 kB
    v2: 363M hero run (Muon hybrid, WSD, Liger, SmolLM 75/15/10 mix) about 1 month ago
  • launch_vast.sh
    2.8 kB
    v1.5 pivot: 152M (18L x 768d) hero config, ReLU2 FFN, final logit soft-cap. 350M kept as reference. about 1 month ago
  • prepare_smollm_data.py
    4.63 kB
    v2: 363M hero run (Muon hybrid, WSD, Liger, SmolLM 75/15/10 mix) about 1 month ago