1 18 4

Hanrong Ye

leoye

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago

Nemotron-Labs-Diffusion

upvoted a paper 3 months ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

upvoted a paper 5 months ago

DFlash: Block Diffusion for Flash Speculative Decoding

View all activity

Organizations

upvoted a collection about 1 month ago

Nemotron-Labs-Diffusion

Collection

A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding • 7 items • Updated 16 days ago • 50

upvoted a paper 3 months ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 23

upvoted a paper 5 months ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 87

upvoted a paper 6 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 233

upvoted 2 papers 7 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 128

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published Nov 24, 2025 • 37

upvoted 3 papers 8 months ago

Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

Paper • 2510.19808 • Published Oct 22, 2025 • 30

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published Oct 16, 2025 • 18

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 93

upvoted a paper 9 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 183

upvoted a paper 12 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161

upvoted a paper about 1 year ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

upvoted 5 papers over 1 year ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25, 2025 • 42

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published Dec 10, 2024 • 74

3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Paper • 2410.01647 • Published Oct 2, 2024 • 31

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9, 2024 • 70

MM-Ego: Towards Building Egocentric Multimodal LLMs

Paper • 2410.07177 • Published Oct 9, 2024 • 22

upvoted a paper over 2 years ago

SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis

Paper • 2311.03355 • Published Nov 6, 2023 • 2

Hanrong Ye

AI & ML interests

Recent Activity

Organizations

leoye's activity