SeeFun's picture

🏗️ Building on HF

SeeFun

AI4Industry

·

seefun

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

baidu/Unlimited-OCR

liked a model 5 days ago

UniParser/EM3M-Gen

liked a dataset 5 days ago

View all activity

Organizations

upvoted a collection 3 months ago

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 16 days ago • 50

upvoted a paper 4 months ago

OmniScience: A Large-scale Multi-modal Dataset for Scientific Image Understanding

Paper • 2602.13758 • Published Feb 14 • 6

upvoted 2 articles 5 months ago

Article

流式数据集：效率提升 100 倍

+3

andito, lhoestq, burtenshaw, pcuenq, merve

•

Oct 27, 2025

• 7

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

lightonai

•

Jan 19

• 96

upvoted a collection 5 months ago

TranslateGemma

3 items • Updated Mar 12 • 245

upvoted 2 papers 6 months ago

Uni-Parser Technical Report

Paper • 2512.15098 • Published Dec 17, 2025 • 2

RxnBench: A Multimodal Benchmark for Evaluating Large Language Models on Chemical Reaction Understanding from Scientific Literature

Paper • 2512.23565 • Published Dec 29, 2025 • 1

upvoted a collection 6 months ago

RxnBench

Chemical Reaction VQA Benchmark • 2 items • Updated 5 days ago • 1

upvoted a paper 8 months ago

olmOCR 2: Unit Test Rewards for Document OCR

Paper • 2510.19817 • Published Oct 22, 2025 • 17

upvoted a paper 9 months ago

SAIL-VL2 Technical Report

Paper • 2509.14033 • Published Sep 17, 2025 • 46

upvoted 2 collections 10 months ago

MolDet

Molecule Image Detection • 4 items • Updated 5 days ago • 2

MolParser

Molecule Image Recognition • 2 items • Updated 5 days ago • 1

upvoted an article 11 months ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

nvidia

•

Aug 11, 2025

• 76

upvoted a paper about 1 year ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 164

upvoted a collection about 1 year ago

SigLIP 2

OpenCLIP and timm SigLIP 2 models • 49 items • Updated May 18 • 27

upvoted 3 articles over 1 year ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

+5

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 343

Article

Open-source DeepResearch – Freeing our search agents

+3

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

Article

Timm ❤️ Transformers: Use any timm model with transformers

+3

ariG23498, rwightman, qubvel-hf, pcuenq, reach-vb

•

Jan 16, 2025

• 55

upvoted a paper over 1 year ago

MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild

Paper • 2411.11098 • Published Nov 17, 2024 • 1

upvoted a collection almost 2 years ago

Qwen2-VL

Vision-language model series based on Qwen2 • 15 items • Updated Mar 2 • 233