AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

MiniT2I/MiniT2I

liked a model 7 days ago

BiliSakura/JiT-diffusers

upvoted a paper 7 days ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

View all activity

Organizations

upvoted a paper 7 days ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

Paper • 2606.14700 • Published 13 days ago • 18

upvoted a paper about 1 month ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 114

upvoted 2 papers about 2 months ago

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published May 7 • 54

Efficient Training on Multiple Consumer GPUs with RoundPipe

Paper • 2604.27085 • Published Apr 29 • 47

upvoted 2 papers 3 months ago

One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Paper • 2512.07829 • Published Dec 8, 2025 • 25

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 166

upvoted an article 5 months ago

Article

Visualize and understand GPU memory in PyTorch

qgallouedec

•

Dec 24, 2024

• 273

upvoted a paper 5 months ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published Jan 12 • 53

upvoted an article 6 months ago

Article

混合专家模型（MoE）详解

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 87

upvoted an article 7 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

GarvinRay

AI & ML interests

Recent Activity

Organizations

GarvinRay's activity

Visualize and understand GPU memory in PyTorch

混合专家模型（MoE）详解

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM