5 30 31

Xiao Xu

LooperXX

https://github.com/LooperXX

AI & ML interests

Vision-Language Learning, Large Language Model.

Recent Activity

upvoted a paper about 9 hours ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

authored a paper about 2 months ago

Qwen-Image-2.0 Technical Report

upvoted a paper about 2 months ago

Qwen-Image-2.0 Technical Report

View all activity

Organizations

upvoted a paper about 9 hours ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Paper • 2606.26907 • Published 1 day ago • 29

upvoted a paper about 2 months ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 114

upvoted a paper 5 months ago

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Paper • 2601.17367 • Published Jan 24 • 33

upvoted a paper 11 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 276

upvoted a paper 12 months ago

AI4Research: A Survey of Artificial Intelligence for Scientific Research

Paper • 2507.01903 • Published Jul 2, 2025 • 5

upvoted a paper over 1 year ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 219

upvoted a collection over 1 year ago

Deepseek Papers

Collection

Deepseek papers collection • 32 items • Updated 4 days ago • 352

upvoted 5 papers over 1 year ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 305

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 380

Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models

Paper • 2412.05939 • Published Dec 8, 2024 • 15

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 71

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 136

upvoted 8 papers almost 2 years ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 157

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 80

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10, 2024 • 42

Xiao Xu

AI & ML interests

Recent Activity

Organizations

LooperXX's activity