hjkim

hojie11

·

hojie11

AI & ML interests

Computer Vision, 3D Vision, Anomaly Detection

Recent Activity

upvoted a paper 8 days ago

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

upvoted a paper 8 days ago

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

upvoted a paper 10 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

View all activity

Organizations

None yet

upvoted 2 papers 8 days ago

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Paper • 2606.15133 • Published 17 days ago • 74

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

Paper • 2606.19195 • Published 13 days ago • 137

upvoted a paper 10 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Paper • 2606.19704 • Published 12 days ago • 41

upvoted 4 papers 13 days ago

World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible

Paper • 2606.13652 • Published 19 days ago • 15

μ_0: A Scalable 3D Interaction-Trace World Model

Paper • 2606.13769 • Published 19 days ago • 10

From AGI to ASI

Paper • 2606.12683 • Published 20 days ago • 35

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

Paper • 2606.13432 • Published 19 days ago • 112

upvoted 2 papers 19 days ago

Latent Spatial Memory for Video World Models

Paper • 2606.09828 • Published 22 days ago • 71

Agents' Last Exam

Paper • 2606.05405 • Published 27 days ago • 368

upvoted 6 papers about 1 month ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published May 26 • 145

TideGS: Scalable Training of Over One Billion 3D Gaussian Splatting Primitives via Out-of-Core Optimization

Paper • 2605.20150 • Published May 19 • 7

RT-Splatting: Joint Reflection-Transmission Modeling with Gaussian Splatting

Paper • 2605.18263 • Published May 18 • 9

Aurora: Unified Video Editing with a Tool-Using Agent

Paper • 2605.18748 • Published May 18 • 29

When Vision Speaks for Sound

Paper • 2605.16403 • Published May 13 • 161

UniT: Unified Geometry Learning with Group Autoregressive Transformer

Paper • 2605.21131 • Published May 20 • 8

upvoted 2 papers about 2 months ago

RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments

Paper • 2604.26067 • Published Apr 28 • 75

MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons

Paper • 2604.28130 • Published Apr 30 • 23

upvoted 3 papers 2 months ago

Sapiens2

Paper • 2604.21681 • Published Apr 23 • 22

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 64