3 6 13

Jian Hu

jianh-nvidia

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

zai-org/GLM-5.2

liked a model 11 days ago

Hcompany/Holo-3.1-35B-A3B

liked a model 12 days ago

moonshotai/Kimi-K2.7-Code

View all activity

Organizations

liked a model 9 days ago

zai-org/GLM-5.2

Text Generation • 753B • Updated 3 days ago • 67.1k • • 2.47k

liked a model 11 days ago

Hcompany/Holo-3.1-35B-A3B

Image-Text-to-Text • 35B • Updated 23 days ago • 4.88k • 63

liked a model 12 days ago

moonshotai/Kimi-K2.7-Code

Image-Text-to-Text • 1.1T • Updated 11 days ago • 502k • • 992

liked a model 29 days ago

Hcompany/Holotron-3-Nano

Image-Text-to-Text • 33B • Updated Apr 28 • 605 • 25

upvoted a paper 30 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published about 1 month ago • 144

liked a model about 1 month ago

Hcompany/Holo3-35B-A3B

Image-Text-to-Text • 35B • Updated Apr 2 • 20.3k • 363

liked a model 2 months ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 3 days ago • 1.88M • • 5.06k

liked a model 3 months ago

Jackrong/Qwopus3.5-27B-v3

Image-Text-to-Text • 27B • Updated Apr 16 • 810 • 249

liked a dataset 3 months ago

ServiceNow/VideoCUA

Updated Mar 30 • 644 • 32

liked 2 models 3 months ago

#51 opened 3 months ago by

jianh-nvidia

upvoted a paper 3 months ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published Mar 19 • 14

liked a model 4 months ago

TeichAI/Qwen3.5-27B-Claude-Opus-4.6-Distill

Image-Text-to-Text • 27B • Updated Mar 4 • 25 • 44

liked a dataset 4 months ago

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 32.5k • 467

upvoted 2 papers 5 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 56

liked a dataset 5 months ago

billxbf/aimo_hard_bilingual

Viewer • Updated Mar 1, 2025 • 3.56k • 18 • 1

upvoted a paper 6 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 233

upvoted a paper 7 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 107

Jian Hu

AI & ML interests

Recent Activity

Organizations

jianh-nvidia's activity

How do you hack the cot responses?