Char LLMs inkoziev/charllama-35M Text Generation • Updated Sep 4, 2023 • 18 • 6 inkoziev/chargpt-96M Text Generation • Updated Sep 2, 2023 • 50 • 3 TencentARC/LLaMA-Pro-8B Text Generation • 8B • Updated Jan 8, 2024 • 868 • 170 BlinkDL/rwkv-5-world Text Generation • Updated Apr 3, 2024 • 270
LLMs tricks LLaMA Pro: Progressive LLaMA with Block Expansion Paper • 2401.02415 • Published Jan 4, 2024 • 54
RU Sentence Encoding ai-forever/sbert_large_mt_nlu_ru Feature Extraction • 0.4B • Updated Apr 17, 2025 • 1.16k • • 26 ai-forever/sbert_large_nlu_ru Feature Extraction • 0.4B • Updated Apr 18, 2025 • 43.8k • • 100
RU models Vikhrmodels/Vikhr-7b-0.1 Text Generation • 7B • Updated Mar 11, 2024 • 17 • 57 CohereLabs/aya-101 13B • Updated Sep 10, 2025 • 1.74k • 664 sambanovasystems/SambaLingo-Russian-Chat Text Generation • 7B • Updated Apr 16, 2024 • 53 • 54 ai-forever/sbert_large_nlu_ru Feature Extraction • 0.4B • Updated Apr 18, 2025 • 43.8k • • 100
Long Context LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 116
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 116
Char LLMs inkoziev/charllama-35M Text Generation • Updated Sep 4, 2023 • 18 • 6 inkoziev/chargpt-96M Text Generation • Updated Sep 2, 2023 • 50 • 3 TencentARC/LLaMA-Pro-8B Text Generation • 8B • Updated Jan 8, 2024 • 868 • 170 BlinkDL/rwkv-5-world Text Generation • Updated Apr 3, 2024 • 270
RU Sentence Encoding ai-forever/sbert_large_mt_nlu_ru Feature Extraction • 0.4B • Updated Apr 17, 2025 • 1.16k • • 26 ai-forever/sbert_large_nlu_ru Feature Extraction • 0.4B • Updated Apr 18, 2025 • 43.8k • • 100
LLMs tricks LLaMA Pro: Progressive LLaMA with Block Expansion Paper • 2401.02415 • Published Jan 4, 2024 • 54
RU models Vikhrmodels/Vikhr-7b-0.1 Text Generation • 7B • Updated Mar 11, 2024 • 17 • 57 CohereLabs/aya-101 13B • Updated Sep 10, 2025 • 1.74k • 664 sambanovasystems/SambaLingo-Russian-Chat Text Generation • 7B • Updated Apr 16, 2024 • 53 • 54 ai-forever/sbert_large_nlu_ru Feature Extraction • 0.4B • Updated Apr 18, 2025 • 43.8k • • 100
Long Context LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 116
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 116