13 34

Park Jae Hyun

Jvehyun

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

liked a model 11 days ago

OpenGVLab/InternVideo2_5_Chat_8B

liked a model 12 days ago

allenai/MolmoPoint-Vid-4B

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 25 days ago • 126

liked a model 11 days ago

OpenGVLab/InternVideo2_5_Chat_8B

Video-Text-to-Text • 8B • Updated Aug 4, 2025 • 5.47k • 91

liked a model 12 days ago

allenai/MolmoPoint-Vid-4B

Video-Text-to-Text • 5B • Updated Mar 30 • 477 • 12

liked a model 24 days ago

nvidia/Cosmos3-Super-Image2Video

Image-to-Video • 65B • Updated 12 days ago • 39.5k • 130

upvoted an article 27 days ago

Article

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

nvidia

•

May 23

• 34

liked a model 27 days ago

nvidia/Nemotron-Labs-Diffusion-VLM-8B

Image-Text-to-Text • 9B • Updated 24 days ago • 3.84k • 26

liked a model about 2 months ago

WepeNerd/Obscura_Remova

Image-to-Video • Updated 17 days ago • 2.2k • 49

upvoted a collection about 2 months ago

ByteDance Papers

Collection

ByteDance papers collection • 142 items • Updated 5 days ago • 35

upvoted a collection 2 months ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated Mar 2 • 90

upvoted a paper 2 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 167

liked 3 models 2 months ago

liked a dataset 3 months ago

lambda/hermes-agent-reasoning-traces

Viewer • Updated Apr 17 • 14.7k • 3.06k • 370

upvoted a paper 3 months ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 237

liked 3 models 3 months ago

k2-fsa/OmniVoice

Text-to-Speech • 0.6B • Updated May 7 • 1.02M • 1.09k

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated Apr 6 • 151k • • 2.89k

Lightricks/LTX-2-19b-LoRA-Camera-Control-Static

Text-to-Video • Updated Jan 5 • • 28

upvoted a paper 3 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 344

liked a model 3 months ago

RealRestorer/RealRestorer

Updated Mar 29 • 253 • 66

Park Jae Hyun

AI & ML interests

Recent Activity

Organizations

Jvehyun's activity

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models