6 24 25

Jihwan Kim

jjihwannn

https://jjihwan.github.io/

AI & ML interests

Computer Vision, Diffusion Models, Generative Models

Recent Activity

upvoted an article about 1 month ago

Welcome Gemma 4: Frontier multimodal intelligence on device

liked a dataset 3 months ago

mvp-lab/LLaVA-OneVision-1.5-Instruct-Data

upvoted a paper 3 months ago

Language Self-Play For Data-Free Training

View all activity

Organizations

upvoted an article about 1 month ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 894

liked a dataset 3 months ago

mvp-lab/LLaVA-OneVision-1.5-Instruct-Data

Viewer • Updated Nov 21, 2025 • 21.9M • 85.3k • 71

upvoted a paper 3 months ago

Language Self-Play For Data-Free Training

Paper • 2509.07414 • Published Sep 9, 2025 • 31

upvoted an article 6 months ago

Article

Streaming datasets: 100x More Efficient

andito, lhoestq, burtenshaw, pcuenq, merve

•

Oct 27, 2025

• 86

liked a model 7 months ago

OpenGVLab/InternVL3_5-8B-Flash

Image-Text-to-Text • 9B • Updated Sep 28, 2025 • 501 • 5

upvoted a paper 7 months ago

VideoNSA: Native Sparse Attention Scales Video Understanding

Paper • 2510.02295 • Published Oct 2, 2025 • 10

upvoted a paper 9 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 221

New activity in ShareGPT4Video/ShareGPT4Video 9 months ago

4.8M videos

#26 opened 9 months ago by

jjihwannn

upvoted a collection 10 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 562

liked 2 datasets 10 months ago

OpenGVLab/InternVideo2_Vid_Text

Viewer • Updated Jul 10, 2024 • 40.5M • 69 • 15

OpenGVLab/InternVid-Full

Viewer • Updated Jun 5, 2024 • 47.6M • 427 • 16

upvoted a paper 11 months ago

STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing

Paper • 2506.22868 • Published Jun 28, 2025 • 5

upvoted a paper 12 months ago

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21, 2025 • 56

upvoted a collection 12 months ago

Unofficial Mamba2 for Hf Transformers

Collection

Just the original weights converted to be compatible with transformers. • 5 items • Updated Oct 16, 2024 • 2

upvoted 3 papers about 1 year ago

liked a model about 1 year ago

LGAI-EXAONE/EXAONE-Deep-32B

Text Generation • 32B • Updated Feb 6 • 553 • 300

upvoted 2 papers over 1 year ago

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published Feb 4, 2025 • 66

Weak-to-Strong Diffusion with Reflection

Paper • 2502.00473 • Published Feb 1, 2025 • 24

Jihwan Kim

AI & ML interests

Recent Activity

Organizations

jjihwannn's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

Streaming datasets: 100x More Efficient

4.8M videos