8 18 3

Zhaowei Wang

ZhaoweiWang

https://zhaowei-wang-nlp.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper 10 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

commentedon a paper 12 days ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

upvoted a paper 13 days ago

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

View all activity

Organizations

upvoted a paper 10 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 12 days ago • 48

commented a paper 12 days ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89 •

upvoted a paper 13 days ago

Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation

Paper • 2606.12594 • Published 19 days ago • 17

updated a dataset 22 days ago

ZhaoweiWang/MMLongBench

Preview • Updated 22 days ago • 1.36k • 5

upvoted a paper 27 days ago

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Paper • 2605.31433 • Published about 1 month ago • 28

submitted a paper to Daily Papers about 1 month ago

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Paper • 2605.06527 • Published May 7 • 47

upvoted 2 papers about 1 month ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published May 11 • 46

MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

Paper • 2605.14906 • Published May 14 • 79

submitted a paper to Daily Papers about 1 month ago

MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

Paper • 2605.14906 • Published May 14 • 79

upvoted a paper about 1 month ago

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Paper • 2605.06527 • Published May 7 • 47

updated a collection about 1 month ago

MMProLong

Collection

A 7B LVLM with 128K context window and 512K generalization through long-context continued pre-training • 1 item • Updated May 15

authored 2 papers about 2 months ago

CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?

Paper • 2510.24505 • Published Oct 28, 2025 • 5

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89

upvoted a paper about 2 months ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89

submitted a paper to Daily Papers about 2 months ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89

authored 5 papers 6 months ago

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

Paper • 2505.00675 • Published May 1, 2025 • 3

AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph

Paper • 2311.09174 • Published Nov 15, 2023

AbsInstruct: Eliciting Abstraction Ability from LLMs through Explanation Tuning with Plausibility Estimation

Paper • 2402.10646 • Published Feb 16, 2024

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3, 2025 • 25

NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents

Paper • 2510.07172 • Published Oct 8, 2025 • 28

Zhaowei Wang

AI & ML interests

Recent Activity

Organizations

ZhaoweiWang's activity