Foresight: Failure Detection for Long-Horizon Robotic Manipulation with Action-Conditioned World Model Latents Paper • 2606.23085 • Published 13 days ago • 14
Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention Paper • 2606.20945 • Published 17 days ago • 76
StepPO: Step-Aligned Policy Optimization for Agentic Reinforcement Learning Paper • 2604.18401 • Published about 1 month ago • 7
GD^2PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization Paper • 2606.16771 • Published 20 days ago • 13
RefGC-SR^2: Reference-guided Generated Content Super-Resolution and Refinement Paper • 2606.15158 • Published 22 days ago • 9
Reinforcement Learning-Guided Retrieval with Soft Fusion for Robust Multimodal Imitation Learning under Missing Modalities Paper • 2606.15514 • Published 22 days ago • 3
Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe Paper • 2606.20381 • Published 17 days ago • 10
WorldLines: Benchmarking and Modeling Long-Horizon Stateful Embodied Agents Paper • 2606.18847 • Published 18 days ago • 5
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems Paper • 2604.04936 • Published Jan 8 • 26
Komodo: A Linguistic Expedition into Indonesia's Regional Languages Paper • 2403.09362 • Published Mar 14, 2024 • 11
Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models Paper • 2401.02333 • Published Jan 4, 2024 • 7
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19, 2025 • 89
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published Apr 3, 2025 • 90