Ruilin's picture

Ruilin

Antimage01

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

tencent/Hy3-preview

liked a model about 1 month ago

tencent/Hy-MT2-30B-A3B

authored a paper 4 months ago

From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning

View all activity

Organizations

upvoted a paper 4 months ago

From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning

Paper • 2603.03825 • Published Mar 4 • 11

upvoted a paper 11 months ago

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published Jul 17, 2025 • 43

upvoted a paper about 1 year ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26, 2025 • 173

upvoted 4 papers over 1 year ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 219

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23, 2025 • 54

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published Jan 8, 2025 • 18

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8, 2025 • 53

upvoted a collection over 1 year ago

Qwen2-VL

Vision-language model series based on Qwen2 • 15 items • Updated Mar 2 • 233

upvoted 2 papers over 1 year ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 162

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 63