26 46 11

Xilin Wei

Wiselnn

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

upvoted a paper 24 days ago

DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes

upvoted a paper about 1 month ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

View all activity

Organizations

upvoted a paper 3 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 5 days ago • 46

upvoted a paper 24 days ago

DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes

Paper • 2605.28421 • Published 26 days ago • 47

upvoted a paper about 1 month ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published May 11 • 46

liked a dataset 3 months ago

internlm/WildClawBench

Benchmark • Updated May 15 • 11.2k • 62

upvoted a paper 3 months ago

Visual-ERM: Reward Modeling for Visual Equivalence

Paper • 2603.13224 • Published Mar 13 • 21

authored a paper 3 months ago

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Paper • 2603.12252 • Published Mar 12 • 12

upvoted a paper 3 months ago

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Paper • 2603.12252 • Published Mar 12 • 12

upvoted a collection 4 months ago

ARM-Thinker

Collection

[CVPR2026] Official Implementation of "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning" • 1 item • Updated Feb 26 • 1

upvoted a paper 4 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Paper • 2602.12205 • Published Feb 13 • 83

upvoted 2 papers 6 months ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Paper • 2512.11799 • Published Dec 12, 2025 • 30

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published Dec 8, 2025 • 60

upvoted a paper 7 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50

upvoted a paper 8 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 110

upvoted 2 papers 9 months ago

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26, 2025 • 19

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26, 2025 • 37

commented a paper 9 months ago

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26, 2025 • 37 •

updated a collection 9 months ago

SIM-CoT: Supervised Implicit Chain-of-Thought

Collection

Official checkpoint repository of "SIM-CoT: Supervised Implicit Chain-of-Thought" • 6 items • Updated Sep 28, 2025 • 2

Xilin Wei

AI & ML interests

Recent Activity

Organizations

Wiselnn's activity