Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kamanphoebe
/
moe_surpass_dense
like
0
arxiv:
2506.12119
License:
mit
Model card
Files
Files and versions
xet
Community
main
moe_surpass_dense
/
SFT_models
177 GB
1 contributor
History:
13 commits
kamanphoebe
Upload SFT_models/SFT_7B_strict_reuse_ar0_3007.pt with huggingface_hub
a5d5853
verified
8 months ago
SFT_7B_dense_baseline_130B.pt
pickle
Detected Pickle imports (9)
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy.core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"numpy.dtype"
,
"torch.bfloat16"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
How to fix it?
14 GB
xet
Upload SFT_models/SFT_7B_dense_baseline_130B.pt with huggingface_hub
8 months ago
SFT_7B_dense_baseline_68B.pt
pickle
Detected Pickle imports (9)
"numpy.dtype"
,
"torch.BFloat16Storage"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
,
"numpy.core.multiarray._reconstruct"
,
"torch.bfloat16"
,
"_codecs.encode"
,
"torch._utils._rebuild_tensor_v2"
,
"numpy.ndarray"
How to fix it?
14 GB
xet
Upload SFT_models/SFT_7B_dense_baseline_68B.pt with huggingface_hub
8 months ago
SFT_7B_strict_reuse_ar0_1119.pt
pickle
Detected Pickle imports (9)
"torch._utils._rebuild_tensor_v2"
,
"numpy.ndarray"
,
"numpy.dtype"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"_codecs.encode"
,
"torch.ByteStorage"
,
"numpy.core.multiarray._reconstruct"
,
"torch.bfloat16"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_strict_reuse_ar0_1119.pt with huggingface_hub
8 months ago
SFT_7B_strict_reuse_ar0_1563.pt
pickle
Detected Pickle imports (9)
"torch.bfloat16"
,
"collections.OrderedDict"
,
"numpy.core.multiarray._reconstruct"
,
"torch.BFloat16Storage"
,
"numpy.dtype"
,
"torch.ByteStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"_codecs.encode"
,
"numpy.ndarray"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_strict_reuse_ar0_1563.pt with huggingface_hub
8 months ago
SFT_7B_strict_reuse_ar0_2007.pt
pickle
Detected Pickle imports (9)
"torch._utils._rebuild_tensor_v2"
,
"numpy.ndarray"
,
"numpy.dtype"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"_codecs.encode"
,
"torch.ByteStorage"
,
"numpy.core.multiarray._reconstruct"
,
"torch.bfloat16"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_strict_reuse_ar0_2007.pt with huggingface_hub
8 months ago
SFT_7B_strict_reuse_ar0_2618.pt
pickle
Detected Pickle imports (9)
"numpy.dtype"
,
"torch.bfloat16"
,
"collections.OrderedDict"
,
"numpy.core.multiarray._reconstruct"
,
"torch._utils._rebuild_tensor_v2"
,
"_codecs.encode"
,
"torch.BFloat16Storage"
,
"numpy.ndarray"
,
"torch.ByteStorage"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_strict_reuse_ar0_2618.pt with huggingface_hub
8 months ago
SFT_7B_strict_reuse_ar0_3007.pt
pickle
Detected Pickle imports (9)
"collections.OrderedDict"
,
"torch.bfloat16"
,
"_codecs.encode"
,
"numpy.core.multiarray._reconstruct"
,
"numpy.dtype"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.ByteStorage"
,
"numpy.ndarray"
,
"torch.BFloat16Storage"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_strict_reuse_ar0_3007.pt with huggingface_hub
8 months ago
SFT_7B_unique_data_ar0_1119.pt
pickle
Detected Pickle imports (9)
"numpy.core.multiarray._reconstruct"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.bfloat16"
,
"numpy.ndarray"
,
"torch.BFloat16Storage"
,
"_codecs.encode"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
,
"numpy.dtype"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_unique_data_ar0_1119.pt with huggingface_hub
8 months ago
SFT_7B_unique_data_ar0_1563.pt
pickle
Detected Pickle imports (9)
"torch.ByteStorage"
,
"numpy.core.multiarray._reconstruct"
,
"torch.bfloat16"
,
"numpy.dtype"
,
"numpy.ndarray"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"_codecs.encode"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_unique_data_ar0_1563.pt with huggingface_hub
8 months ago
SFT_7B_unique_data_ar0_2007.pt
pickle
Detected Pickle imports (9)
"collections.OrderedDict"
,
"_codecs.encode"
,
"torch.bfloat16"
,
"numpy.core.multiarray._reconstruct"
,
"torch.BFloat16Storage"
,
"numpy.dtype"
,
"torch._utils._rebuild_tensor_v2"
,
"numpy.ndarray"
,
"torch.ByteStorage"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_unique_data_ar0_2007.pt with huggingface_hub
8 months ago
SFT_7B_unique_data_ar0_2618.pt
pickle
Detected Pickle imports (9)
"numpy.core.multiarray._reconstruct"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"numpy.ndarray"
,
"_codecs.encode"
,
"torch.BFloat16Storage"
,
"torch.ByteStorage"
,
"torch.bfloat16"
,
"numpy.dtype"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_unique_data_ar0_2618.pt with huggingface_hub
8 months ago
SFT_7B_unique_data_ar0_3007.pt
pickle
Detected Pickle imports (9)
"numpy.dtype"
,
"torch.ByteStorage"
,
"numpy.core.multiarray._reconstruct"
,
"collections.OrderedDict"
,
"_codecs.encode"
,
"torch.BFloat16Storage"
,
"numpy.ndarray"
,
"torch.bfloat16"
,
"torch._utils._rebuild_tensor_v2"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_unique_data_ar0_3007.pt with huggingface_hub
8 months ago
SFT_7B_unique_data_ar0_5338.pt
pickle
Detected Pickle imports (9)
"collections.OrderedDict"
,
"numpy.dtype"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.bfloat16"
,
"numpy.ndarray"
,
"torch.ByteStorage"
,
"numpy.core.multiarray._reconstruct"
,
"_codecs.encode"
How to fix it?
13.6 GB
xet
Upload SFT_models/SFT_7B_unique_data_ar0_5338.pt with huggingface_hub
8 months ago