-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 435 -
Recursive Language Models
Paper • 2512.24601 • Published • 63 -
Geospatial Mechanistic Interpretability of Large Language Models
Paper • 2505.03368 • Published • 12 -
GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI
Paper • 2511.15658 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2512.24601
-
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Paper • 2512.24617 • Published • 56 -
Recursive Language Models
Paper • 2512.24601 • Published • 63 -
Nested Learning: The Illusion of Deep Learning Architectures
Paper • 2512.24695 • Published • 35 -
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
Paper • 2512.02556 • Published • 251
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 75 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 106 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 504 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 30
-
Forgetting Transformer: Softmax Attention with a Forget Gate
Paper • 2503.02130 • Published • 32 -
L^2M: Mutual Information Scaling Law for Long-Context Language Modeling
Paper • 2503.04725 • Published • 21 -
Transformers without Normalization
Paper • 2503.10622 • Published • 170 -
I-Con: A Unifying Framework for Representation Learning
Paper • 2504.16929 • Published • 30
-
NitroGen: An Open Foundation Model for Generalist Gaming Agents
Paper • 2601.02427 • Published • 41 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 254 -
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Paper • 2512.24165 • Published • 48 -
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
Paper • 2601.02151 • Published • 98
-
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 110 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 25 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 46
-
The Leaderboard Illusion
Paper • 2504.20879 • Published • 72 -
SmolVLM: Redefining small and efficient multimodal models
Paper • 2504.05299 • Published • 203 -
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 105 -
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 23
-
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper • 2402.13753 • Published • 116 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 79 -
Larimar: Large Language Models with Episodic Memory Control
Paper • 2403.11901 • Published • 33 -
Evolutionary Optimization of Model Merging Recipes
Paper • 2403.13187 • Published • 58
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 435 -
Recursive Language Models
Paper • 2512.24601 • Published • 63 -
Geospatial Mechanistic Interpretability of Large Language Models
Paper • 2505.03368 • Published • 12 -
GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI
Paper • 2511.15658 • Published • 1
-
NitroGen: An Open Foundation Model for Generalist Gaming Agents
Paper • 2601.02427 • Published • 41 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 254 -
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models
Paper • 2512.24165 • Published • 48 -
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
Paper • 2601.02151 • Published • 98
-
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Paper • 2512.24617 • Published • 56 -
Recursive Language Models
Paper • 2512.24601 • Published • 63 -
Nested Learning: The Illusion of Deep Learning Architectures
Paper • 2512.24695 • Published • 35 -
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
Paper • 2512.02556 • Published • 251
-
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 110 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 25 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 46
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 75 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 106 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 504 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 30
-
The Leaderboard Illusion
Paper • 2504.20879 • Published • 72 -
SmolVLM: Redefining small and efficient multimodal models
Paper • 2504.05299 • Published • 203 -
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 105 -
Small Language Models are the Future of Agentic AI
Paper • 2506.02153 • Published • 23
-
Forgetting Transformer: Softmax Attention with a Forget Gate
Paper • 2503.02130 • Published • 32 -
L^2M: Mutual Information Scaling Law for Long-Context Language Modeling
Paper • 2503.04725 • Published • 21 -
Transformers without Normalization
Paper • 2503.10622 • Published • 170 -
I-Con: A Unifying Framework for Representation Learning
Paper • 2504.16929 • Published • 30
-
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper • 2402.13753 • Published • 116 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 79 -
Larimar: Large Language Models with Episodic Memory Control
Paper • 2403.11901 • Published • 33 -
Evolutionary Optimization of Model Merging Recipes
Paper • 2403.13187 • Published • 58