Zhaowei Wang

ZhaoweiWang

·

https://zhaowei-wang-nlp.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper 14 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

commentedon a paper 16 days ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

upvoted a paper 16 days ago

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

View all activity

Organizations

upvoted a paper 14 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 15 days ago • 49

upvoted a paper 16 days ago

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

Paper • 2606.12594 • Published 22 days ago • 17

upvoted a paper about 1 month ago

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Paper • 2605.31433 • Published May 29 • 28

upvoted 4 papers about 2 months ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published May 11 • 46

MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

Paper • 2605.14906 • Published May 14 • 79

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Paper • 2605.06527 • Published May 7 • 47

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89

upvoted a paper 6 months ago

Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents

Paper • 2512.20092 • Published Dec 23, 2025 • 9

upvoted a paper 9 months ago

Learning GUI Grounding with Spatial Reasoning from Visual Feedback

Paper • 2509.21552 • Published Sep 25, 2025 • 11

upvoted 2 papers 10 months ago

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3, 2025 • 25

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 128

upvoted 6 papers about 1 year ago

How Alignment Shrinks the Generative Horizon

Paper • 2506.17871 • Published Jun 22, 2025 • 6

A Controllable Examination for Long-Context Language Models

Paper • 2506.02921 • Published Jun 3, 2025 • 34

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 146

Neurosymbolic Diffusion Models

Paper • 2505.13138 • Published May 19, 2025 • 36

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15, 2025 • 56

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

upvoted a paper over 2 years ago

Contrastive Chain-of-Thought Prompting

Paper • 2311.09277 • Published Nov 15, 2023 • 35