906 52

Bingzheng Wei

Bingzheng

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

upvoted a paper 1 day ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

upvoted a paper 1 day ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

View all activity

Organizations

None yet

upvoted a paper about 6 hours ago

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Paper • 2605.28293 • Published 3 days ago • 78

upvoted 5 papers 1 day ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 3 days ago • 76

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 4 days ago • 115

OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning

Paper • 2605.28691 • Published 3 days ago • 19

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 3 days ago • 230

ResearchMath-14K: Scaling Research-Level Mathematics via Agents

Paper • 2605.28003 • Published 3 days ago • 43

upvoted 2 papers 2 days ago

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published 5 days ago • 130

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 8 days ago • 201

upvoted 5 papers 5 days ago

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

Paper • 2605.18018 • Published 12 days ago • 32

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 10 days ago • 204

upvoted 3 papers 7 days ago

Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos

Paper • 2605.18233 • Published 12 days ago • 91

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 9 days ago • 46

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published 11 days ago • 102

upvoted 4 papers 8 days ago

FlowLong: Inference-time Long Video Generation via Manifold-constrained Tweedie Matching

Paper • 2605.20910 • Published 10 days ago • 29

Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning

Paper • 2605.22642 • Published 9 days ago • 35

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 16 days ago • 145

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Paper • 2605.19660 • Published 11 days ago • 39

Bingzheng Wei

AI & ML interests

Recent Activity

Organizations

Bingzheng's activity