🏗️ Building on HF

Taufiq Dwi Purnomo

taufiqdp

·

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

liked a model 5 days ago

deepseek-ai/DeepSeek-V4-Flash-DSpark

liked a model 9 days ago

baidu/Unlimited-OCR

liked a model 10 days ago

zai-org/GLM-5.2

View all activity

Organizations

upvoted a collection 27 days ago

Gemma 4 QAT Q4_0

19 items • Updated 27 days ago • 139

upvoted 2 papers about 1 month ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 249

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 223

upvoted a paper about 2 months ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 116

upvoted an article 2 months ago

Article

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

nvidia

•

Apr 28

• 62

upvoted 2 collections 2 months ago

MiMo-V2.5

4 items • Updated Apr 27 • 90

DeepSeek-V4

6 items • Updated 6 days ago • 710

upvoted a paper 2 months ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published Apr 17 • 59

upvoted a collection 2 months ago

Qwen3.6

4 items • Updated Apr 22 • 425

upvoted a collection 3 months ago

Gemma 4

15 items • Updated 22 days ago • 1.01k

upvoted an article 3 months ago

Article

State of Open Source on Hugging Face: Spring 2026

huggingface

•

Mar 17

• 93

upvoted a paper 3 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 189

upvoted 2 collections 4 months ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 17 days ago • 161

Qwen3.5

21 items • Updated Mar 9 • 1.7k

upvoted a paper 4 months ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 76

upvoted 3 papers 5 months ago

Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

Paper • 2602.06570 • Published Feb 6 • 61

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 183

VIBEVOICE-ASR Technical Report

Paper • 2601.18184 • Published Mar 14 • 24

upvoted an article 5 months ago

Article

NVIDIA Earth-2 Open Models Span the Whole Weather Stack

nvidia

•

Jan 26

• 36

upvoted a paper 6 months ago

TranslateGemma Technical Report

Paper • 2601.09012 • Published Jan 13 • 22