27 15 24

Jaemin Cho

j-min

https://j-min.io

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation

submitted a paper about 12 hours ago

Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation

upvoted a paper 8 days ago

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

View all activity

Organizations

upvoted a paper about 12 hours ago

Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation

Paper • 2606.25306 • Published 2 days ago • 1

upvoted a paper 8 days ago

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

Paper • 2606.18558 • Published 9 days ago • 50

upvoted a collection about 2 months ago

MolmoAct2 Models

Collection

Collection of the base models for MolmoAct2 • 6 items • Updated May 5 • 23

upvoted 2 papers 3 months ago

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Paper • 2604.08516 • Published Apr 9 • 47

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Paper • 2603.24575 • Published Mar 25 • 19

upvoted a paper 11 months ago

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Paper • 2508.05954 • Published Aug 8, 2025 • 6

upvoted a paper 12 months ago

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Paper • 2507.07202 • Published Jul 9, 2025 • 25

upvoted 3 papers about 1 year ago

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

Paper • 2505.21876 • Published May 28, 2025 • 9

CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting

Paper • 2504.15485 • Published Apr 21, 2025 • 4

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

Paper • 2504.09763 • Published Apr 14, 2025 • 12

upvoted a collection about 1 year ago

EFAGen

Collection

4 items • Updated Apr 15, 2025 • 2

upvoted a paper about 1 year ago

Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization

Paper • 2504.08641 • Published Apr 11, 2025 • 6

upvoted 2 papers over 1 year ago

DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation

Paper • 2411.16657 • Published Nov 25, 2024 • 19

VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement

Paper • 2411.15115 • Published Nov 22, 2024 • 10

upvoted a paper over 2 years ago

VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

Paper • 2309.15091 • Published Sep 26, 2023 • 35

Jaemin Cho

AI & ML interests

Recent Activity

Organizations

j-min's activity