view article Article How I contributed a new model to the Transformers library using Codex nielsr β’ Mar 30 β’ 51
π€ SmolLM2 Automatic Essay Grading Collection Automatic Essay Grading - SmolLM2 β’ 15 items β’ Updated Jun 9, 2025 β’ 1
πͺ Qwen2.5 Automatic Essay Grading Collection Automatic Essay Grading - Qwen2.5 β’ 15 items β’ Updated Jun 9, 2025 β’ 1
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper β’ 2501.12599 β’ Published Jan 22, 2025 β’ 130
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention sirluk β’ Oct 7, 2024 β’ 71
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq β’ Dec 11, 2023 β’ 1.12k
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge NormalUhr β’ Feb 7, 2025 β’ 292
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks β’ 8 items β’ Updated Jul 31, 2025 β’ 34
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun β’ Jan 28, 2025 β’ 888
view article Article Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline: Omartificial-Intelligence-Space β’ Nov 30, 2024 β’ 28
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Paper β’ 2404.05726 β’ Published Apr 8, 2024 β’ 23
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12, 2024 β’ 253
Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models Paper β’ 2312.17661 β’ Published Dec 29, 2023 β’ 15
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper β’ 2312.11514 β’ Published Dec 12, 2023 β’ 264
Distributed Representations of Words and Phrases and their Compositionality Paper β’ 1310.4546 β’ Published Oct 16, 2013 β’ 3
Efficient Estimation of Word Representations in Vector Space Paper β’ 1301.3781 β’ Published Jan 16, 2013 β’ 8
LoRA: Low-Rank Adaptation of Large Language Models Paper β’ 2106.09685 β’ Published Jun 17, 2021 β’ 60
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning Paper β’ 2012.13255 β’ Published Dec 22, 2020 β’ 5