23 26 28

Cihang Xie

cihangxie

https://cihangxie.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

VisualClaw: A Real-Time, Personalized Agent for the Physical World

liked a dataset 12 days ago

UCSC-VLAA/VisualClawArena

upvoted a paper about 1 month ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

View all activity

Organizations

upvoted a paper 12 days ago

VisualClaw: A Real-Time, Personalized Agent for the Physical World

Paper • 2606.16295 • Published 13 days ago • 28

upvoted 2 papers about 1 month ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 190

AudioMosaic: Contrastive Masked Audio Representation Learning

Paper • 2605.14231 • Published May 14 • 3

upvoted 2 papers 2 months ago

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

Paper • 2604.21375 • Published Apr 23 • 19

Target-Oriented Pretraining Data Selection via Neuron-Activated Graph

Paper • 2604.15706 • Published Apr 17 • 10

upvoted 4 papers 3 months ago

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Paper • 2604.04759 • Published Apr 6 • 24

ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published Apr 5 • 37

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published Apr 2 • 31

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 141

upvoted a paper 5 months ago

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Paper • 2601.15369 • Published Jan 21 • 22

upvoted a paper 6 months ago

SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 38

upvoted a paper 7 months ago

SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards

Paper • 2511.07403 • Published Nov 10, 2025 • 14

upvoted a collection 7 months ago

SpatialThinker

Collection

This collection consists of SpatialThinker 3B and 7B model checkpoints, and STVQA-7K, a Spatial VQA dataset used for training the models. • 4 items • Updated Nov 12, 2025 • 1

upvoted 2 papers 8 months ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 60

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Paper • 2510.22946 • Published Oct 27, 2025 • 18

upvoted a paper 10 months ago

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published Sep 1, 2025 • 34

upvoted a paper 11 months ago

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Paper • 2507.21033 • Published Jul 28, 2025 • 23

upvoted 3 papers about 1 year ago

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7, 2025 • 29

Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark

Paper • 2504.13143 • Published Apr 17, 2025 • 7

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10, 2025 • 30

Cihang Xie

AI & ML interests

Recent Activity

Organizations

cihangxie's activity