QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published Dec 15, 2025 • 107
IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval Paper • 2504.00954 • Published Apr 1, 2025 • 2
Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following Paper • 2508.02150 • Published Aug 4, 2025 • 37