Hongxu Yin's picture

Hongxu Yin

yinhongxu

·

AI & ML interests

None yet

Recent Activity

new activity 2 months ago

nvidia/NVILA-8B-HD-Video:Update README.md

new activity 2 months ago

nvidia/NVILA-8B-HD-Video:Update README.md

new activity 2 months ago

nvidia/AutoGaze:Update README.md

View all activity

Organizations

New activity in nvidia/NVILA-8B-HD-Video 2 months ago

Update README.md

#2 opened 2 months ago by

Update README.md

#3 opened 2 months ago by

New activity in nvidia/AutoGaze 2 months ago

Update README.md

#2 opened 2 months ago by

New activity in nvidia/NVILA-8B-HD-Video 2 months ago

Update README.md

#1 opened 2 months ago by

published a model 2 months ago

nvidia/NVILA-8B-HD-Video

Updated Mar 19 • 411 • 39

updated a model 2 months ago

nvidia/NVILA-8B-HD-Video

Updated Mar 19 • 411 • 39

published a model 2 months ago

nvidia/AutoGaze

Updated Mar 19 • 6.41k • 25

updated a model 3 months ago

nvidia/AutoGaze

Updated Mar 19 • 6.41k • 25

upvoted a paper 4 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 231

authored a paper 4 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 231

authored 10 papers 5 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161

NaVILA: Legged Robot Vision-Language-Action Model for Navigation

Paper • 2412.04453 • Published Dec 5, 2024

EgoVLA: Learning Vision-Language-Action Models from Egocentric Human Videos

Paper • 2507.12440 • Published Jul 16, 2025

3D Aware Region Prompted Vision Language Model

Paper • 2509.13317 • Published Sep 16, 2025 • 14

Test-Time Scaling Strategies for Generative Retrieval in Multimodal Conversational Recommendations

Paper • 2508.18132 • Published Aug 25, 2025

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 182

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 92

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published Oct 16, 2025 • 18

SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models

Paper • 2406.01584 • Published Jun 3, 2024