arxiv:2402.03746
Yura Choi
Yuuraa
AI & ML interests
Large Multimodal Models, Video Understanding
Recent Activity
submitted
a paper
about 3 hours ago
Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering upvoted a paper 3 days ago
OpenClaw-RL: Train Any Agent Simply by Talking upvoted a paper 4 months ago
Thinking with Video: Video Generation as a Promising Multimodal
Reasoning Paradigm Organizations
None yet