Yongliang Wu

Liang0223

·

https://yongliang-wu.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models

upvoted a paper about 2 months ago

EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents

upvoted a paper about 2 months ago

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models

Paper • 2605.17672 • Published May 17 • 23

upvoted 3 papers about 2 months ago

EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents

Paper • 2605.13941 • Published May 13 • 24

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Paper • 2605.12501 • Published May 12 • 16

Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria

Paper • 2605.08354 • Published May 8 • 23

upvoted 2 papers 3 months ago

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published Mar 5 • 56

GEditBench v2: A Human-Aligned Benchmark for General Image Editing

Paper • 2603.28547 • Published Mar 30 • 32

upvoted a paper 6 months ago

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published Dec 16, 2025 • 121

upvoted a paper 7 months ago

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

Paper • 2512.01816 • Published Dec 1, 2025 • 94

upvoted a paper 9 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 183

upvoted 2 papers 10 months ago

From Editor to Dense Geometry Estimator

Paper • 2509.04338 • Published Sep 4, 2025 • 96

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4, 2025 • 77

upvoted 5 papers 11 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190

Phi-Ground Tech Report: Advancing Perception in GUI Grounding

Paper • 2507.23779 • Published Jul 31, 2025 • 46

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3, 2024 • 55

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Paper • 2506.07977 • Published Jun 9, 2025 • 40

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22, 2025 • 76

upvoted 4 papers 12 months ago

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

Paper • 2507.09862 • Published Jul 14, 2025 • 52

LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers

Paper • 2507.04404 • Published Jul 6, 2025 • 22

Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Paper • 2507.08441 • Published Jul 11, 2025 • 63

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15, 2025 • 20