Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
O2iginal
/
dsv3_0.5b
like
0
yulanmini
hybrid
mamba
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
e9efaa5
dsv3_0.5b
753 MB
Ctrl+K
Ctrl+K
1 contributor
History:
13 commits
O2iginal
Upload __19_1.distcp to dsv3_0.5b
e9efaa5
verified
8 months ago
.gitattributes
1.82 kB
Upload __19_1.distcp to dsv3_0.5b
8 months ago
LOG_NODE_RANK_3.log
Safe
5.55 MB
Upload LOG_NODE_RANK_3.log to dsv3_0.5b
8 months ago
README.md
Safe
348 Bytes
Add model card with tags for dsv3_0.5b
8 months ago
__11_0.distcp
125 MB
xet
Upload __11_0.distcp to dsv3_0.5b
8 months ago
__17_0.distcp
124 MB
xet
Upload __17_0.distcp to dsv3_0.5b
8 months ago
__19_1.distcp
124 MB
xet
Upload __19_1.distcp to dsv3_0.5b
8 months ago
__20_0.distcp
124 MB
xet
Upload __20_0.distcp to dsv3_0.5b
8 months ago
__2_0.distcp
125 MB
xet
Upload __2_0.distcp to dsv3_0.5b
8 months ago
__31_1.distcp
125 MB
xet
Upload __31_1.distcp to dsv3_0.5b
8 months ago
common.pt
pickle
Detected Pickle imports (6)
"megatron.core.transformer.enums.AttnBackend"
,
"megatron.core.enums.ModelType"
,
"torch.bfloat16"
,
"torch.float32"
,
"megatron.core.rerun_state_machine.RerunMode"
,
"argparse.Namespace"
How to fix it?
19 kB
xet
Upload common.pt to dsv3_0.5b
8 months ago
dsv3_0.5b_pretrain_template.sh
Safe
8.63 kB
Upload dsv3_0.5b_pretrain_template.sh to dsv3_0.5b
8 months ago
run_2node_dsv3_0.5b_pretrain.sh
Safe
2.07 kB
Upload run_2node_dsv3_0.5b_pretrain.sh to dsv3_0.5b
8 months ago
upload_status.json
Safe
781 Bytes
Upload upload_status.json to dsv3_0.5b
8 months ago