Collections
Discover the best community collections!
Collections including paper arxiv:2601.03425
-
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Paper • 2512.23343 • Published • 25 -
Valori: A Deterministic Memory Substrate for AI Systems
Paper • 2512.22280 • Published • 3 -
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
Paper • 2512.23959 • Published • 96 -
Nested Learning: The Illusion of Deep Learning Architectures
Paper • 2512.24695 • Published • 34
-
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Paper • 2410.10814 • Published • 51 -
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Paper • 2502.16894 • Published • 32 -
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Paper • 2506.14731 • Published • 8 -
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
Paper • 2506.18349 • Published • 13
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 220 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents
Paper • 2512.23343 • Published • 25 -
Valori: A Deterministic Memory Substrate for AI Systems
Paper • 2512.22280 • Published • 3 -
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling
Paper • 2512.23959 • Published • 96 -
Nested Learning: The Illusion of Deep Learning Architectures
Paper • 2512.24695 • Published • 34
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 220 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Paper • 2410.10814 • Published • 51 -
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Paper • 2502.16894 • Published • 32 -
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
Paper • 2506.14731 • Published • 8 -
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
Paper • 2506.18349 • Published • 13