1 10 7

sonho

tiiktak

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

mvp-lab/LLaVA-OneVision-2-Data

liked a model 2 months ago

baidu/ERNIE-Image-Turbo

liked a model 2 months ago

baidu/ERNIE-Image

View all activity

Organizations

None yet

liked a dataset about 1 month ago

mvp-lab/LLaVA-OneVision-2-Data

Viewer • Updated May 11 • 24 • 162k • 30

liked 2 models 2 months ago

baidu/ERNIE-Image-Turbo

Text-to-Image • Updated Apr 17 • 4.33k • • 398

baidu/ERNIE-Image

Text-to-Image • Updated Apr 17 • 45.7k • • 658

upvoted a paper 4 months ago

Eureka-Audio: Triggering Audio Intelligence in Compact Language Models

Paper • 2602.13954 • Published Feb 15 • 4

upvoted 2 papers 5 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 269

ShowUI-Aloha: Human-Taught GUI Agent

Paper • 2601.07181 • Published Jan 12 • 4

New activity in baidu/ERNIE-4.5-VL-28B-A3B-Thinking 6 months ago

Render timestamps on video frames for vLLM inference

#13 opened 6 months ago by

tiiktak

liked a model 8 months ago

baidu/ERNIE-4.5-VL-28B-A3B-Thinking

Image-Text-to-Text • 30B • Updated Mar 6 • 175 • 541

upvoted a paper 8 months ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16, 2025 • 129

liked 2 models 10 months ago

baidu/ERNIE-4.5-VL-424B-A47B-PT

Image-Text-to-Text • 424B • Updated Jan 16 • 95 • 106

baidu/ERNIE-4.5-21B-A3B-Thinking

Text Generation • 22B • Updated Nov 26, 2025 • 15k • 786

upvoted a paper 12 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161

upvoted 3 papers about 1 year ago

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21, 2025 • 53

Efficient Agent Training for Computer Use

Paper • 2505.13909 • Published May 20, 2025 • 44

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 16

upvoted a collection over 1 year ago

GUI Datasets

Collection

Datasets from the graphical user interfaces domain (screenshots). • 20 items • Updated Dec 3, 2024 • 8

upvoted a paper over 1 year ago

Contrastive Localized Language-Image Pre-Training

Paper • 2410.02746 • Published Oct 3, 2024 • 36

updated a Space almost 2 years ago

Tune An Ellipse

🏃

liked a Space about 3 years ago

VisorGPT

📉