7 146 56

Frank Sommers PRO

fsommers

fsommers

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification

upvoted a collection 5 days ago

Qwen3.6

upvoted an article 6 days ago

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

View all activity

Organizations

upvoted a paper 5 days ago

Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification

Paper • 2606.18249 • Published 9 days ago • 14

upvoted a collection 5 days ago

Qwen3.6

Collection

4 items • Updated Apr 22 • 418

upvoted an article 6 days ago

Article

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

lightonai

•

Apr 21

• 42

upvoted 2 articles 9 days ago

Article

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

lightonai

•

Feb 19

• 22

Article

Party is over: regularizing ColBERT models to fix efficient ANN methods

lightonai

•

9 days ago

• 23

liked a model 20 days ago

google/gemma-4-12B

Any-to-Any • 12B • Updated 21 days ago • 338k • 608

upvoted a paper about 1 month ago

PaperFit: Vision-in-the-Loop Typesetting Optimization for Scientific Documents

Paper • 2605.10341 • Published May 11 • 35

upvoted a collection about 2 months ago

Gemma 4

Collection

15 items • Updated 14 days ago • 991

upvoted 2 papers about 2 months ago

Efficient Training on Multiple Consumer GPUs with RoundPipe

Paper • 2604.27085 • Published Apr 29 • 47

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Paper • 2604.28075 • Published Apr 30 • 20

liked 4 models about 2 months ago

upvoted a collection about 2 months ago

DeepSeek-V4

Collection

4 items • Updated Apr 24 • 691

liked a model 2 months ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated Apr 24 • 5.5M • • 2.23k

liked a model 3 months ago

google/gemma-4-31B

Image-Text-to-Text • 33B • Updated 22 days ago • 513k • 429

upvoted a paper 3 months ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 124

liked a Space 3 months ago

Gemma 4 WebGPU

🚀

232

Run Gemma 4 locally in-browser on WebGPU w/ Transformers.js

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 909

Frank Sommers PRO

AI & ML interests

Recent Activity

Organizations

fsommers's activity

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

**ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?**

Party is over: regularizing ColBERT models to fix efficient ANN methods

Gemma 4 WebGPU

Welcome Gemma 4: Frontier multimodal intelligence on device

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?