guoguoc PRO

woshichaoren123

28 10

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

One Model, Many Latencies: Universal Speech Enhancement for Diverse Real-Time Applications

upvoted a paper 10 days ago

Text-Vision Co-Instructed Image Editing

upvoted a paper 13 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

View all activity

Organizations

None yet

upvoted a paper 4 days ago

One Model, Many Latencies: Universal Speech Enhancement for Diverse Real-Time Applications

Paper • 2606.25621 • Published 6 days ago • 13

upvoted a paper 10 days ago

Text-Vision Co-Instructed Image Editing

Paper • 2606.16767 • Published 15 days ago • 19

upvoted a paper 13 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 14 days ago • 63

upvoted a paper 15 days ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22, 2025 • 43

upvoted 4 papers 17 days ago

LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories

Paper • 2606.13578 • Published 19 days ago • 56

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

Paper • 2606.12087 • Published 20 days ago • 77

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 19 days ago • 142

InterleaveThinker: Reinforcing Agentic Interleaved Generation

Paper • 2606.13679 • Published 19 days ago • 82

upvoted a paper 18 days ago

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Paper • 2606.13673 • Published 19 days ago • 109

upvoted a paper 25 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 29 days ago • 136

New activity in nvidia/LocateAnything-3B 27 days ago

Inference support for vLLM and SGLang OpenAI endpoints

➕ 14

#3 opened about 1 month ago by

Vishva007

liked a dataset 29 days ago

VCLab-PolyU/GGT-100K

Updated 28 days ago • 3.64k • 44

upvoted a paper about 1 month ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published May 27 • 93

liked a Space about 1 month ago

LocateAnything

💬

358

Detect and label objects in images and videos

liked a model about 1 month ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 17 days ago • 728k • 2.47k

upvoted 2 papers about 1 month ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published May 26 • 145

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published May 25 • 138

updated a dataset about 1 month ago

woshichaoren123/vis_data_0424_data

Updated May 26 • 7

upvoted a paper about 1 month ago

Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving

Paper • 2605.22809 • Published May 21 • 27

updated a Space about 1 month ago

ERQA v6 Error Browser

🚀

Explore and analyze ERQA v6 model errors