-
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
Paper • 2508.11987 • Published • 72 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 214 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 509
Garrosh Icecream
GarroshIcecream
AI & ML interests
From tiny SLMs to massive LLMs. I’m all about text-to-text fun.
Organizations
None yet
READ ON TOILET
-
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
Paper • 2508.09834 • Published • 53 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 233 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 146 -
DeMo: Decoupled Momentum Optimization
Paper • 2411.19870 • Published • 6
P(DOOM) = 1.0
-
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
Paper • 2508.11987 • Published • 72 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 214 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 509
READ ON TOILET
-
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
Paper • 2508.09834 • Published • 53 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 233 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 146 -
DeMo: Decoupled Momentum Optimization
Paper • 2411.19870 • Published • 6
models 0
None public yet
datasets 0
None public yet