Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
O2iginal
/
dsv3_0.5b
like
0
yulanmini
hybrid
mamba
Model card
Files
Files and versions
xet
Community
bebc181
dsv3_0.5b
1 GB
1 contributor
History:
15 commits
O2iginal
Upload __25_0.distcp to dsv3_0.5b
bebc181
verified
5 months ago
.gitattributes
1.92 kB
Upload __25_0.distcp to dsv3_0.5b
5 months ago
LOG_NODE_RANK_3.log
Safe
5.55 MB
Upload LOG_NODE_RANK_3.log to dsv3_0.5b
5 months ago
README.md
Safe
348 Bytes
Add model card with tags for dsv3_0.5b
5 months ago
__11_0.distcp
125 MB
xet
Upload __11_0.distcp to dsv3_0.5b
5 months ago
__17_0.distcp
124 MB
xet
Upload __17_0.distcp to dsv3_0.5b
5 months ago
__19_1.distcp
Safe
124 MB
xet
Upload __19_1.distcp to dsv3_0.5b
5 months ago
__20_0.distcp
Safe
124 MB
xet
Upload __20_0.distcp to dsv3_0.5b
5 months ago
__25_0.distcp
Safe
124 MB
xet
Upload __25_0.distcp to dsv3_0.5b
5 months ago
__2_0.distcp
Safe
125 MB
xet
Upload __2_0.distcp to dsv3_0.5b
5 months ago
__31_1.distcp
Safe
125 MB
xet
Upload __31_1.distcp to dsv3_0.5b
5 months ago
__6_0.distcp
Safe
125 MB
xet
Upload __6_0.distcp to dsv3_0.5b
5 months ago
common.pt
pickle
Detected Pickle imports (6)
"megatron.core.transformer.enums.AttnBackend"
,
"megatron.core.enums.ModelType"
,
"torch.bfloat16"
,
"torch.float32"
,
"megatron.core.rerun_state_machine.RerunMode"
,
"argparse.Namespace"
How to fix it?
19 kB
xet
Upload common.pt to dsv3_0.5b
5 months ago
dsv3_0.5b_pretrain_template.sh
Safe
8.63 kB
Upload dsv3_0.5b_pretrain_template.sh to dsv3_0.5b
5 months ago
run_2node_dsv3_0.5b_pretrain.sh
Safe
2.07 kB
Upload run_2node_dsv3_0.5b_pretrain.sh to dsv3_0.5b
5 months ago
upload_status.json
Safe
781 Bytes
Upload upload_status.json to dsv3_0.5b
5 months ago