hjkim

hojie11

·

hojie11

AI & ML interests

Computer Vision, 3D Vision, Anomaly Detection

Recent Activity

upvoted a paper about 5 hours ago

Domain Arithmetic: One-Shot VLA Adaptation under Environmental Shifts

upvoted a paper 1 day ago

MemLearner: Learning to Query Context memory for Video World Models

upvoted a paper 1 day ago

PolyFlow: Continuous Topology Embedding Flow Matching for Artist-style Mesh Generation

View all activity

Organizations

None yet

upvoted a paper about 5 hours ago

Domain Arithmetic: One-Shot VLA Adaptation under Environmental Shifts

Paper • 2607.00666 • Published 1 day ago • 15

upvoted 2 papers 1 day ago

MemLearner: Learning to Query Context memory for Video World Models

Paper • 2606.31734 • Published 2 days ago • 19

PolyFlow: Continuous Topology Embedding Flow Matching for Artist-style Mesh Generation

Paper • 2606.30673 • Published 7 days ago • 9

upvoted 2 papers 10 days ago

DragMesh-2: Physically Plausible Dexterous Hand-Object Interaction with Articulated Objects

Paper • 2606.15133 • Published 19 days ago • 74

Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

Paper • 2606.19195 • Published 15 days ago • 139

upvoted a paper 13 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Paper • 2606.19704 • Published 14 days ago • 41

upvoted 4 papers 16 days ago

World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible

Paper • 2606.13652 • Published 21 days ago • 15

μ_0: A Scalable 3D Interaction-Trace World Model

Paper • 2606.13769 • Published 21 days ago • 10

From AGI to ASI

Paper • 2606.12683 • Published 22 days ago • 35

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

Paper • 2606.13432 • Published 21 days ago • 112

upvoted 2 papers 22 days ago

Latent Spatial Memory for Video World Models

Paper • 2606.09828 • Published 24 days ago • 71

Agents' Last Exam

Paper • 2606.05405 • Published 29 days ago • 372

upvoted 6 papers about 1 month ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published May 26 • 145

TideGS: Scalable Training of Over One Billion 3D Gaussian Splatting Primitives via Out-of-Core Optimization

Paper • 2605.20150 • Published May 19 • 7

RT-Splatting: Joint Reflection-Transmission Modeling with Gaussian Splatting

Paper • 2605.18263 • Published May 18 • 9

Aurora: Unified Video Editing with a Tool-Using Agent

Paper • 2605.18748 • Published May 18 • 29

When Vision Speaks for Sound

Paper • 2605.16403 • Published May 13 • 161

UniT: Unified Geometry Learning with Group Autoregressive Transformer

Paper • 2605.21131 • Published May 20 • 8

upvoted 2 papers about 2 months ago

RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments

Paper • 2604.26067 • Published Apr 28 • 75

MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons

Paper • 2604.28130 • Published Apr 30 • 23