Emanuele Vivoli

emanuelevivoli

https://www.emanuelevivoli.me

AI & ML interests

I work on Comics/Manga :)

Recent Activity

new activity about 1 month ago

rednote-hilab/dots.mocr:[DRAFT] fix: transformers 5.x compat (cache_position + kwargs naming)

new activity about 1 month ago

rednote-hilab/dots.ocr:[DRAFT] fix: transformers 5.x compat (cache_position + kwargs naming)

new activity about 1 month ago

rednote-hilab/dots.ocr:[DRAFT] fix: transformers 5.x compat (cache_position + kwargs naming)

View all activity

Organizations

upvoted an article 2 months ago

Article

DeepSeek-V4: a million-token context that agents can actually use

burtenshaw

•

Apr 24

• 50

upvoted an article 3 months ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

lightonai

•

Jan 19

• 96

upvoted a collection 3 months ago

SigLIP2

Collection

36 items • Updated Mar 12 • 122

upvoted a paper 6 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 329

upvoted an article 7 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 411

upvoted 2 articles 8 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 343

Article

Granite 4.0 Nano: Just how small can you go?

ibm-granite

•

Oct 28, 2025

• 125

upvoted a paper 8 months ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 103

upvoted an article 8 months ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

upvoted an article 9 months ago

Article

Preference Optimization for Vision Language Models

qgallouedec, vwxyzjn, merve, kashif

•

Jul 10, 2024

• 93

upvoted a collection 9 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 748

upvoted a paper 10 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 274

upvoted an article 11 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 513

upvoted 3 papers 11 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 161

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23, 2025 • 35

upvoted 2 collections 11 months ago

Tar

Collection

[NeurIPS 2025] Unifying Visual Understanding and Generation via Text-Aligned Representations • 5 items • Updated Sep 20, 2025 • 18

Open LLM Leaderboard best models ❤️‍🔥

Collection

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 50 items • Updated Mar 13 • 694

upvoted an article 12 months ago

Article

Efficient MultiModal Data Pipeline

ariG23498, lusxvr, andito, sergiopaniego, pcuenq

•

Jul 8, 2025

• 72

upvoted a paper about 1 year ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 278

Emanuele Vivoli

AI & ML interests

Recent Activity

Organizations

emanuelevivoli's activity

DeepSeek-V4: a million-token context that agents can actually use

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

Continuous batching from first principles

SmolVLM2: Bringing Video Understanding to Every Device

Granite 4.0 Nano: Just how small can you go?

Vision Language Models (Better, faster, stronger)

Preference Optimization for Vision Language Models

Welcome GPT OSS, the new open-source model family from OpenAI!

Efficient MultiModal Data Pipeline