Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

IvanHU
/
hyw_moe

Model card Files Files and versions
xet
Community
hyw_moe / model
24.5 GB
  • 1 contributor
History: 6 commits
IvanHU's picture
IvanHU
Upload folder using huggingface_hub
931e105 verified 7 months ago
  • dev-0.5b-q16-kv2-ep-16-sep--top2-bias-1e-3-bf16-ep4-mp2-pp1-lr-2e-3-minlr-7e-7-bs-1024-gpus-8-seqlen-8192
    Upload folder using huggingface_hub 7 months ago
  • dev-32e-dsv3-0.5b-q16-kv2-ep-32-sep-0-top2-cf-0-bias-1e-3-bf16-ep4-mp2-pp1-lr-2e-3-minlr-7e-7-bs-1024-gpus-8-seqlen-8192
    Upload folder using huggingface_hub 7 months ago
  • dev-4k-dsv3-0.5b-q16-kv2-ep-16-sep-0-top2-cf-0-bias-1e-3-bf16-ep4-mp2-pp1-lr-2e-3-minlr-7e-7-bs-1024-gpus-8-seqlen-4096
    Upload folder using huggingface_hub 7 months ago
  • dev-a3e-dsv3-0.5b-q16-kv2-ep-16-sep-0-top3-cf-0-bias-1e-3-bf16-ep4-mp2-pp1-lr-2e-3-minlr-7e-7-bs-1024-gpus-8-seqlen-8192
    Upload folder using huggingface_hub 7 months ago
  • dev-auxfree-0.5b-q10-kv2-ep-64-sep-2-top6-cf-0-bias-1e-3-bf16-ep8-mp2-pp1-lr-7.8e-4-minlr-7e-7-bs-1024-gpus-16-seqlen-8192
    Upload folder using huggingface_hub 7 months ago
  • dev-auxfree-0.5b-q10-kv2-ep-64-sep-2-top6-cf-0-bias-1e-3-bf16-ep8-mp2-pp1-lr-7.8e-4-minlr-7e-7-bs-1024-gpus-32-seqlen-8192
    Upload folder using huggingface_hub 7 months ago