Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

O2iginal
/
dsv3_0.5b

yulanmini
hybrid
mamba
Model card Files Files and versions
xet
Community
dsv3_0.5b
628 MB
  • 1 contributor
History: 12 commits
O2iginal's picture
O2iginal
Upload __17_0.distcp to dsv3_0.5b
ec168a9 verified 5 months ago
  • .gitattributes
    1.77 kB
    Upload __17_0.distcp to dsv3_0.5b 5 months ago
  • LOG_NODE_RANK_3.log
    5.55 MB
    Upload LOG_NODE_RANK_3.log to dsv3_0.5b 5 months ago
  • README.md
    348 Bytes
    Add model card with tags for dsv3_0.5b 5 months ago
  • __11_0.distcp
    125 MB
    xet
    Upload __11_0.distcp to dsv3_0.5b 5 months ago
  • __17_0.distcp
    124 MB
    xet
    Upload __17_0.distcp to dsv3_0.5b 5 months ago
  • __20_0.distcp
    124 MB
    xet
    Upload __20_0.distcp to dsv3_0.5b 5 months ago
  • __2_0.distcp
    125 MB
    xet
    Upload __2_0.distcp to dsv3_0.5b 5 months ago
  • __31_1.distcp
    125 MB
    xet
    Upload __31_1.distcp to dsv3_0.5b 5 months ago
  • common.pt

    Detected Pickle imports (6)

    • "megatron.core.transformer.enums.AttnBackend",
    • "megatron.core.enums.ModelType",
    • "torch.bfloat16",
    • "torch.float32",
    • "megatron.core.rerun_state_machine.RerunMode",
    • "argparse.Namespace"

    How to fix it?

    19 kB
    xet
    Upload common.pt to dsv3_0.5b 5 months ago
  • dsv3_0.5b_pretrain_template.sh
    8.63 kB
    Upload dsv3_0.5b_pretrain_template.sh to dsv3_0.5b 5 months ago
  • run_2node_dsv3_0.5b_pretrain.sh
    2.07 kB
    Upload run_2node_dsv3_0.5b_pretrain.sh to dsv3_0.5b 5 months ago
  • upload_status.json
    781 Bytes
    Upload upload_status.json to dsv3_0.5b 5 months ago