🤝 Open to Collab

Kevin Lin

KevinQHLin

42 94 44

https://qhlin.me/

AI & ML interests

Vision-Language Model, Video Understanding, Agent

Recent Activity

upvoted a paper 14 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

submitted a paper 14 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

upvoted a paper 14 days ago

RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling

View all activity

Organizations

upvoted 3 papers 14 days ago

Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories

Paper • 2606.11176 • Published 21 days ago • 127

RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling

Paper • 2606.06309 • Published 26 days ago • 11

Measuring Epistemic Resilience of LLMs Under Misleading Medical Context

Paper • 2606.12291 • Published 20 days ago • 60

upvoted a paper 18 days ago

TRL-Bench: Standardizing Cross-Paradigm Representation-Level Evaluation of Tabular Encoders

Paper • 2606.09323 • Published 22 days ago • 53

upvoted a paper 20 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 27 days ago • 369

upvoted a paper 24 days ago

Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?

Paper • 2606.04811 • Published 26 days ago • 17

upvoted 2 papers 25 days ago

The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer

Paper • 2602.02557 • Published May 29 • 21

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 29 days ago • 136

upvoted 5 papers about 1 month ago

D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing

Paper • 2605.25893 • Published May 25 • 39

Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

Paper • 2605.17423 • Published May 17 • 34

Forecasting Scientific Progress with Artificial Intelligence

Paper • 2605.22681 • Published May 21 • 45

CutVerse: A Compositional GUI Agents Benchmark for Media Post-Production Editing

Paper • 2605.19484 • Published May 19 • 21

AI for Auto-Research: Roadmap & User Guide

Paper • 2605.18661 • Published May 18 • 69

upvoted 2 papers about 2 months ago

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper • 2605.13724 • Published May 13 • 105

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published May 4 • 355

upvoted a paper 2 months ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 287

upvoted a collection 2 months ago

TON

Collection

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models. • 7 items • Updated May 23, 2025 • 2

upvoted 2 papers 2 months ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published Apr 24 • 231

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 123

upvoted a paper 3 months ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published Apr 8 • 97

Kevin Lin

AI & ML interests

Recent Activity

Organizations

KevinQHLin's activity