AoqiWu

wswaq

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Scaling Native Multimodal Pre-Training From Scratch

upvoted a paper 2 months ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

upvoted a paper 2 months ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

View all activity

Organizations

None yet

upvoted a paper about 13 hours ago

Scaling Native Multimodal Pre-Training From Scratch

Paper • 2607.22043 • Published 4 days ago • 13

upvoted 2 papers 2 months ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published May 26 • 146

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 263

upvoted a paper 3 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

upvoted 4 papers 5 months ago

Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm

Paper • 2602.11543 • Published Feb 12 • 6

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 354

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Paper • 2603.08652 • Published Mar 9 • 40

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Paper • 2603.03447 • Published Mar 3 • 37

upvoted 2 papers 6 months ago

GEBench: Benchmarking Image Generation Models as GUI Environments

Paper • 2602.09007 • Published Feb 9 • 39

Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis

Paper • 2602.03139 • Published Feb 3 • 45

upvoted an article 12 months ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.16k

upvoted a paper over 1 year ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 167

upvoted a collection over 1 year ago

LLM2CLIP

Collection

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated Mar 2 • 69

AoqiWu

AI & ML interests

Recent Activity

Organizations

wswaq's activity

Mixture of Experts Explained