guoguoc PRO

woshichaoren123

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Vesta: A Generalist Embodied Reasoning Model

upvoted a paper 11 days ago

One Model, Many Latencies: Universal Speech Enhancement for Diverse Real-Time Applications

upvoted a paper 17 days ago

Text-Vision Co-Instructed Image Editing

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Vesta: A Generalist Embodied Reasoning Model

Paper • 2606.20905 • Published 18 days ago • 10

upvoted a paper 11 days ago

One Model, Many Latencies: Universal Speech Enhancement for Diverse Real-Time Applications

Paper • 2606.25621 • Published 12 days ago • 19

upvoted a paper 17 days ago

Text-Vision Co-Instructed Image Editing

Paper • 2606.16767 • Published 21 days ago • 19

upvoted a paper 19 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 20 days ago • 63

upvoted a paper 22 days ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22, 2025 • 43

upvoted 5 papers 24 days ago

LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories

Paper • 2606.13578 • Published 25 days ago • 56

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

Paper • 2606.12087 • Published 26 days ago • 77

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 25 days ago • 142

InterleaveThinker: Reinforcing Agentic Interleaved Generation

Paper • 2606.13679 • Published 25 days ago • 82

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Paper • 2606.13673 • Published 25 days ago • 110

upvoted 5 papers about 1 month ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published Jun 1 • 140

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published May 27 • 93

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published May 26 • 145

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published May 25 • 138

Sensor2Sensor: Cross-Embodiment Sensor Conversion for Autonomous Driving

Paper • 2605.22809 • Published May 21 • 27

upvoted a paper about 2 months ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published May 18 • 116

upvoted 2 papers 2 months ago

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 64

Seeing Fast and Slow: Learning the Flow of Time in Videos

Paper • 2604.21931 • Published Apr 23 • 19

upvoted 2 papers 3 months ago

HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation System

Paper • 2604.14125 • Published Apr 15 • 21

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 182