Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kamanphoebe
/
moe_surpass_dense
like
0
arxiv:
2506.12119
License:
mit
Model card
Files
Files and versions
xet
Community
main
moe_surpass_dense
893 GB
1 contributor
History:
132 commits
kamanphoebe
Upload 7B_strict_reuse/7B_strict_reuse_ar0_1563.pt with huggingface_hub
d1879ee
verified
8 months ago
2B_unique_data_fixAR
Upload 2B_unique_data_fixAR/2B_unique_data_fixAR_ar0_5079_D128B.pt with huggingface_hub
8 months ago
2B_unique_data_fixC
Upload 2B_unique_data_fixC/2B_unique_data_fixC_ar0_5773.pt with huggingface_hub
8 months ago
7B_loose_reuse
Upload 7B_loose_reuse/7B_loose_reuse_ar0_3007.pt with huggingface_hub
8 months ago
7B_strict_reuse
Upload 7B_strict_reuse/7B_strict_reuse_ar0_1563.pt with huggingface_hub
8 months ago
7B_unique_data_fixC
Rename 7B_unique_data_fixC/7B_unique_data_ar0_5338.pt to 7B_unique_data_fixC/7B_unique_data_fixC_ar0_5338.pt
8 months ago
7B_unique_data_fixD
Upload 7B_unique_data_fixD/7B_unique_data_fixD_ar0_5330.pt with huggingface_hub
8 months ago
SFT_models
Upload SFT_models/SFT_7B_strict_reuse_ar0_3007.pt with huggingface_hub
8 months ago
dense_baseline
Rename dense_baseline/2B_unique_data_fixAR_ar0_0874_D114B.pt to 2B_unique_data_fixAR/2B_unique_data_fixAR_ar0_0874_D114B.pt
8 months ago
gate_score_normalization
Upload gate_score_normalization/gate_score_normalization_E37_normN.pt with huggingface_hub
8 months ago
layer_arrangement
Upload layer_arrangement/layer_arrangement_1dense_H22_Dse1600.pt with huggingface_hub
8 months ago
model_shape_ratios
Upload model_shape_ratios/model_shape_ratios_Dm896_L37.pt with huggingface_hub
8 months ago
topK_setting
Upload topK_setting/topK_setting_33in88.pt with huggingface_hub
8 months ago
.gitattributes
1.52 kB
initial commit
8 months ago
README.md
1.23 kB
Update README.md
8 months ago