7707 6

GuoLiangTang

Tommy930

https://github.com/TommyTang930

AI & ML interests

LLM，NLP，ML

Recent Activity

upvoted a paper about 4 hours ago

COrigami: An AI Pipeline for Co-Designing Flat-Foldable Visually Recognisable Origami

upvoted a paper about 4 hours ago

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

upvoted a paper about 10 hours ago

Discretizing Reward Models

View all activity

Organizations

None yet

upvoted 2 papers about 4 hours ago

COrigami: An AI Pipeline for Co-Designing Flat-Foldable Visually Recognisable Origami

Paper • 2606.26299 • Published 3 days ago • 1

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

Paper • 2606.26790 • Published 1 day ago • 31

upvoted a paper about 10 hours ago

Discretizing Reward Models

Paper • 2606.21795 • Published 8 days ago • 2

upvoted 3 papers about 11 hours ago

DanceOPD: On-Policy Generative Field Distillation

Paper • 2606.27377 • Published 1 day ago • 52

GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents

Paper • 2606.24551 • Published 5 days ago • 6

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

Paper • 2606.26300 • Published 3 days ago • 24

upvoted 2 papers about 14 hours ago

Fast LeWorldModel

Paper • 2606.26217 • Published 3 days ago • 5

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Paper • 2606.26907 • Published 1 day ago • 31

upvoted 12 papers 1 day ago

Lite Any Stereo V2: Faster and Stronger Efficient Zero-Shot Stereo Matching

Paper • 2606.24457 • Published 4 days ago • 3

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

Paper • 2606.18831 • Published 10 days ago • 6

Notes2Skills: From Lab Notebooks to Certainty-Aware Scientific Agent Skills

Paper • 2606.11897 • Published 17 days ago • 11

BioMatrix: Towards a Comprehensive Biological Foundation Model Spanning the Modality Matrix of Sequences, Structures, and Language

Paper • 2606.22138 • Published 7 days ago • 22

DataClaw0: Agentic Tailoring Multimodal Data from Raw Streams

Paper • 2606.21337 • Published 8 days ago • 70

ReMMD: Realistic Multilingual Multi-Image Agentic Verification for Multimodal Misinformation Detection

Paper • 2606.24112 • Published 4 days ago • 3

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

Paper • 2606.21661 • Published 8 days ago • 21

ShutterMuse: Capture-Time Photography Guidance with MLLMs

Paper • 2606.25763 • Published 3 days ago • 38

Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

Paper • 2606.25473 • Published 3 days ago • 20

GuoLiangTang

AI & ML interests

Recent Activity

Organizations

Tommy930's activity