14 19

Nikolay Tynyanov

tynyanov

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

liked a model 29 days ago

unsloth/Qwen3.6-27B-NVFP4

liked a model about 1 month ago

AngelSlim/Hy-MT1.5-1.8B-1.25bit

View all activity

Organizations

None yet

upvoted a paper 1 day ago

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

Paper • 2606.03264 • Published 2 days ago • 11

upvoted an article 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 903

upvoted a collection 4 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.67k

upvoted a paper 4 months ago

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Paper • 2602.01785 • Published Feb 2 • 97

upvoted an article 4 months ago

Article

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

lightonai

•

Oct 23, 2025

• 74

upvoted an article 6 months ago

Article

Building Deep Research: How we Achieved State of the Art

Tavily

•

Nov 24, 2025

• 36

upvoted an article 7 months ago

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 314

upvoted a paper 8 months ago

LongCodeZip: Compress Long Context for Code Language Models

Paper • 2510.00446 • Published Oct 1, 2025 • 108

upvoted a paper about 1 year ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 99

upvoted an article about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted 2 papers about 1 year ago

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published Apr 1, 2025 • 37

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 161

upvoted an article about 1 year ago

Article

SigLIP 2: A better multilingual vision language encoder

ariG23498, merve, qubvel-hf

•

Feb 21, 2025

• 216

upvoted an article over 1 year ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 340

Nikolay Tynyanov

AI & ML interests

Recent Activity

Organizations

tynyanov's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

Building Deep Research: How we Achieved State of the Art

Supercharge your OCR Pipelines with Open Models

Vision Language Models (Better, faster, stronger)

SigLIP 2: A better multilingual vision language encoder

SmolVLM2: Bringing Video Understanding to Every Device