Collector 2 - a scottrx11 Collection

scottrx11 's Collections

Collector 2

updated 7 days ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 62
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published Feb 11 • 31
G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design

Paper • 2602.08253 • Published Feb 9 • 27
ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression

Paper • 2602.11008 • Published Feb 11 • 18
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 76
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 139
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Paper • 2603.19235 • Published Mar 19 • 95
Memento-Skills: Let Agents Design Agents

Paper • 2603.18743 • Published Mar 19 • 58
MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning

Paper • 2603.16929 • Published Mar 14 • 13
Prompt-Free Universal Region Proposal Network

Paper • 2603.17554 • Published Mar 18 • 3
COT-FM: Cluster-wise Optimal Transport Flow Matching

Paper • 2603.13395 • Published Mar 11 • 2
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

Paper • 2603.19685 • Published Mar 20 • 22
Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck

Paper • 2603.08462 • Published Mar 9 • 22
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

Paper • 2603.19987 • Published Mar 20 • 9
EgoForge: Goal-Directed Egocentric World Simulator

Paper • 2603.20169 • Published Mar 20 • 10
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published Mar 24 • 36
2Xplat: Two Experts Are Better Than One Generalist

Paper • 2603.21064 • Published Mar 22 • 25
Self-Improving World Modelling with Latent Actions

Paper • 2602.06130 • Published Feb 5 • 32
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14
Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting

Paper • 2603.25745 • Published Mar 26 • 16
Emergent Introspective Awareness in Large Language Models

Paper • 2601.01828 • Published Jan 5
4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6
Light of Normals: Unified Feature Representation for Universal Photometric Stereo

Paper • 2506.18882 • Published Jun 23, 2025 • 89
OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8, 2025 • 186