🏗️ Building on HF

33 23

Yauhen Yavorski

slappatuski

AI & ML interests

image generation, image-to-image, text-to-image, inpainting, and video generation

Recent Activity

upvoted an article 18 days ago

Supercharge your OCR Pipelines with Open Models

liked a model 30 days ago

suno/bark

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2

View all activity

Organizations

upvoted an article 18 days ago

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 315

upvoted 2 papers about 1 month ago

RAG-Anything: All-in-One RAG Framework

Paper • 2510.12323 • Published Oct 14, 2025 • 82

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 137

upvoted 4 papers about 2 months ago

upvoted an article about 2 months ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

lightonai

•

Jan 19

• 96

upvoted 2 papers about 2 months ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 286

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 244

upvoted a paper 2 months ago

Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model

Paper • 2104.09617 • Published Apr 19, 2021 • 2

upvoted an article 2 months ago

Article

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

davidberenstein1957, sdiazlor, Leiyre, dvilasuero, Ameeeee, burtenshaw

•

Dec 16, 2024

• 163

upvoted a collection 2 months ago

BERT release

Collection

Regroups the original BERT models released by the Google team. Except for the models marked otherwise, the checkpoints support English. • 8 items • Updated Mar 12 • 44

upvoted a collection 3 months ago

Gemma 4

Collection

15 items • Updated 17 days ago • 997

upvoted an article 4 months ago

Article

Using LoRA for Efficient Stable Diffusion Fine-Tuning

pcuenq, sayakpaul

•

Jan 26, 2023

• 83

upvoted a paper 9 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 164

upvoted 2 articles 9 months ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

orrzohar, ruili0, andito, nicholswang

•

Jul 23, 2025

• 48

upvoted 2 articles 10 months ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

nvidia

•

Aug 11, 2025

• 76

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Leyo, HugoLaurencon, VictorSanh

•

Apr 15, 2024

• 191

Yauhen Yavorski

AI & ML interests

Recent Activity

Organizations

slappatuski's activity

Supercharge your OCR Pipelines with Open Models

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

Introducing the Synthetic Data Generator - Build Datasets with Natural Language

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Vision Language Models (Better, faster, stronger)

TimeScope: How Long Can Your Video Large Multimodal Model Go?

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community