DE_models Qwen/Qwen1.5-MoE-A2.7B Text Generation • 14B • Updated Apr 18, 2024 • 201k • 227 deepseek-ai/DeepSeek-V2-Lite Text Generation • 16B • Updated Jun 25, 2024 • 445k • 180 mistralai/Mixtral-8x7B-v0.1 47B • Updated Jul 24, 2025 • 46.5k • 1.81k
MoE Open source MoE IEITYuan/Yuan2-M32-hf Text Generation • Updated May 30, 2024 • 795 • 62 allenai/OLMoE-1B-7B-0924 Text Generation • 7B • Updated Oct 19, 2024 • 125k • 145 microsoft/Phi-3.5-MoE-instruct Text Generation • 42B • Updated Dec 10, 2025 • 141k • 574 Qwen/Qwen1.5-MoE-A2.7B Text Generation • 14B • Updated Apr 18, 2024 • 201k • 227
LGViT The checkpoints of LGViT. Paper link: https://arxiv.org/abs/2308.00255 LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Paper • 2308.00255 • Published Aug 1, 2023 FALcon6/LGViT-ViT-Cifar100 Image Classification • Updated Oct 30, 2024 • 26 FALcon6/LGViT-DeiT-Cifar100 Image Classification • Updated Oct 30, 2024 • 20 FALcon6/LGViT-Swin-Cifar100 Image Classification • Updated Oct 30, 2024 • 21
LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Paper • 2308.00255 • Published Aug 1, 2023
DE_models Qwen/Qwen1.5-MoE-A2.7B Text Generation • 14B • Updated Apr 18, 2024 • 201k • 227 deepseek-ai/DeepSeek-V2-Lite Text Generation • 16B • Updated Jun 25, 2024 • 445k • 180 mistralai/Mixtral-8x7B-v0.1 47B • Updated Jul 24, 2025 • 46.5k • 1.81k
MoE Open source MoE IEITYuan/Yuan2-M32-hf Text Generation • Updated May 30, 2024 • 795 • 62 allenai/OLMoE-1B-7B-0924 Text Generation • 7B • Updated Oct 19, 2024 • 125k • 145 microsoft/Phi-3.5-MoE-instruct Text Generation • 42B • Updated Dec 10, 2025 • 141k • 574 Qwen/Qwen1.5-MoE-A2.7B Text Generation • 14B • Updated Apr 18, 2024 • 201k • 227
LGViT The checkpoints of LGViT. Paper link: https://arxiv.org/abs/2308.00255 LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Paper • 2308.00255 • Published Aug 1, 2023 FALcon6/LGViT-ViT-Cifar100 Image Classification • Updated Oct 30, 2024 • 26 FALcon6/LGViT-DeiT-Cifar100 Image Classification • Updated Oct 30, 2024 • 20 FALcon6/LGViT-Swin-Cifar100 Image Classification • Updated Oct 30, 2024 • 21
LGViT: Dynamic Early Exiting for Accelerating Vision Transformer Paper • 2308.00255 • Published Aug 1, 2023