madstuntman11 (s k)

upvoted a collection 7 months ago

Qwen3-Next

Collection

4 items • Updated Dec 31, 2025 • 187

upvoted an article 7 months ago

Article

Engineering Notes: Training a LoRA for Z-Image Turbo with the Ostris AI Toolkit

content-and-code

•

Dec 2, 2025

• 15

upvoted 2 articles 11 months ago

Article

Introducing Command A Vision: Multimodal AI built for Business

CohereLabs

•

Jul 31, 2025

• 64

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

+2

orrzohar, ruili0, andito, nicholswang

•

Jul 23, 2025

• 48

upvoted 3 articles 12 months ago

Article

Image compositing with diffusers

OzzyGT

•

Jul 17, 2025

• 6

Article

Should We Still Pretrain Encoders with Masked Language Modeling?

Nicolas-BZRD

•

Jul 2, 2025

• 22

Article

Open-source DeepResearch – Freeing our search agents

+3

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

upvoted a paper about 1 year ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23, 2025 • 79

upvoted 2 collections about 1 year ago

Releases June 6

Collection

39 items • Updated May 13 • 6

Qwen2.5-Coder

Collection

Code-specific model series based on Qwen2.5 • 38 items • Updated Mar 2 • 372

upvoted 2 articles about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

+3

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

Article

Visual Document Retrieval Goes Multilingual

marco, cheesyFishes

•

Jan 10, 2025

• 77

upvoted 2 articles over 1 year ago

Article

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

paultltc

•

Mar 21, 2025

• 38

Article

Document Similarity Search with ColPali

fsommers

•

Sep 21, 2024

• 52

upvoted 2 articles almost 2 years ago

Article

🤗 PEFT welcomes new merging methods

smangrul, sayakpaul

•

Feb 19, 2024

• 31

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

manu

•

Jul 5, 2024

• 323

upvoted a paper almost 2 years ago

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 51

upvoted a paper over 2 years ago

ResLoRA: Identity Residual Mapping in Low-Rank Adaption

Paper • 2402.18039 • Published Feb 28, 2024 • 11

s k

AI & ML interests