jiahaowang

wang-jiahao

·

https://wang-jiahao.github.io/

wang-jiahao

AI & ML interests

None yet

Recent Activity

updated a dataset 17 days ago

NJU-LINK/AVSCapBench

authored a paper about 2 months ago

OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning

upvoted a paper about 2 months ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

View all activity

Organizations

upvoted 6 papers about 2 months ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published Jun 5 • 123

OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning

Paper • 2606.08572 • Published Jun 7 • 14

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Paper • 2606.08415 • Published Jun 7 • 52

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Paper • 2606.02320 • Published Jun 1 • 15

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

Paper • 2606.01993 • Published Jun 1 • 15

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Paper • 2606.02060 • Published Jun 1 • 58

upvoted a paper 2 months ago

Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution

Paper • 2605.15301 • Published May 14 • 22

upvoted a paper 4 months ago

CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published Apr 13 • 38

upvoted a paper 5 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 312

upvoted a paper 7 months ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Paper • 2512.21094 • Published Dec 24, 2025 • 25

upvoted 3 papers 8 months ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

Paper • 2510.10395 • Published Oct 12, 2025 • 32

ViDiC: Video Difference Captioning

Paper • 2512.03405 • Published Dec 3, 2025 • 29

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 306

upvoted a paper 9 months ago

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published Oct 20, 2025 • 20

upvoted a paper 10 months ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12, 2025 • 46