Rethinking the Multilingual Reasoning Gap with Layer Swap Rethinking the Multilingual Reasoning Gap with Layer Swap Paper • 2605.26735 • Published 26 days ago • 3 lightonai/Dolci-Think-SFT-32B-Multilingual Viewer • Updated 24 days ago • 2.89M • 752 • 2 lightonai/Qwen3-8B-SW Text Generation • 8B • Updated 24 days ago • 92 lightonai/Qwen3-8B-FR Text Generation • 8B • Updated 24 days ago • 97
Rethinking the Multilingual Reasoning Gap with Layer Swap Paper • 2605.26735 • Published 26 days ago • 3
Luth x Qwen3 kurakurai/Luth-1.7B-Instruct Text Generation • 2B • Updated Oct 12, 2025 • 131 • • 15 kurakurai/Luth-0.6B-Instruct Text Generation • 0.6B • Updated Oct 12, 2025 • 645 • • 9 kurakurai/luth-sft Viewer • Updated Oct 12, 2025 • 571k • 297 • 15 kurakurai/Luth-1.7B-Instruct-GGUF Text Generation • 2B • Updated Aug 24, 2025 • 43 • 4
Splitformer MaxLSB/Splitformer Automatic Speech Recognition • Updated Jun 24, 2025 • 1 Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices Paper • 2506.18035 • Published Jun 22, 2025
Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices Paper • 2506.18035 • Published Jun 22, 2025
Luth x LFM2 kurakurai/Luth-LFM2-1.2B Text Generation • 1B • Updated Oct 12, 2025 • 205 • 26 kurakurai/Luth-LFM2-700M Text Generation • 0.7B • Updated Oct 12, 2025 • 149 • 16 kurakurai/Luth-LFM2-350M Text Generation • 0.4B • Updated Oct 12, 2025 • 98 • 16 kurakurai/Luth-LFM2-1.2B-GGUF Text Generation • 1B • Updated Aug 24, 2025 • 77 • 9
LeCarnet MaxLSB/LeCarnet Viewer • Updated Jul 29, 2025 • 2.03M • 138 MaxLSB/LeCarnet-3M Text Generation • 3.74M • Updated Jun 28, 2025 • 6 MaxLSB/LeCarnet-8M Text Generation • 8.53M • Updated Jun 28, 2025 • 4 MaxLSB/LeCarnet-21M Text Generation • 21.3M • Updated Jun 28, 2025 • 2
Rethinking the Multilingual Reasoning Gap with Layer Swap Rethinking the Multilingual Reasoning Gap with Layer Swap Paper • 2605.26735 • Published 26 days ago • 3 lightonai/Dolci-Think-SFT-32B-Multilingual Viewer • Updated 24 days ago • 2.89M • 752 • 2 lightonai/Qwen3-8B-SW Text Generation • 8B • Updated 24 days ago • 92 lightonai/Qwen3-8B-FR Text Generation • 8B • Updated 24 days ago • 97
Rethinking the Multilingual Reasoning Gap with Layer Swap Paper • 2605.26735 • Published 26 days ago • 3
Luth x LFM2 kurakurai/Luth-LFM2-1.2B Text Generation • 1B • Updated Oct 12, 2025 • 205 • 26 kurakurai/Luth-LFM2-700M Text Generation • 0.7B • Updated Oct 12, 2025 • 149 • 16 kurakurai/Luth-LFM2-350M Text Generation • 0.4B • Updated Oct 12, 2025 • 98 • 16 kurakurai/Luth-LFM2-1.2B-GGUF Text Generation • 1B • Updated Aug 24, 2025 • 77 • 9
Luth x Qwen3 kurakurai/Luth-1.7B-Instruct Text Generation • 2B • Updated Oct 12, 2025 • 131 • • 15 kurakurai/Luth-0.6B-Instruct Text Generation • 0.6B • Updated Oct 12, 2025 • 645 • • 9 kurakurai/luth-sft Viewer • Updated Oct 12, 2025 • 571k • 297 • 15 kurakurai/Luth-1.7B-Instruct-GGUF Text Generation • 2B • Updated Aug 24, 2025 • 43 • 4
LeCarnet MaxLSB/LeCarnet Viewer • Updated Jul 29, 2025 • 2.03M • 138 MaxLSB/LeCarnet-3M Text Generation • 3.74M • Updated Jun 28, 2025 • 6 MaxLSB/LeCarnet-8M Text Generation • 8.53M • Updated Jun 28, 2025 • 4 MaxLSB/LeCarnet-21M Text Generation • 21.3M • Updated Jun 28, 2025 • 2
Splitformer MaxLSB/Splitformer Automatic Speech Recognition • Updated Jun 24, 2025 • 1 Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices Paper • 2506.18035 • Published Jun 22, 2025
Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices Paper • 2506.18035 • Published Jun 22, 2025