Building on HF

4 11 79

Sumit Yadav

rockerritesh

https://sumityadav.com.np

AI & ML interests

AI(GAN) || LLM RAG

Recent Activity

liked a model 8 days ago

himalaya-ai/himalayagpt-0.5b

liked a model 9 days ago

openai/privacy-filter

liked a model 14 days ago

facebook/tribev2

View all activity

Organizations

upvoted 2 papers 3 months ago

Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics

Paper • 2602.02343 • Published Feb 2 • 13

On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

Paper • 2602.00130 • Published Jan 28 • 3

upvoted 2 papers 7 months ago

Can maiBERT Speak for Maithili?

Paper • 2509.15048 • Published Sep 18, 2025 • 1

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 34

upvoted a collection about 1 year ago

Cogito v1 Preview

Collection

5 items • Updated Apr 8, 2025 • 119

upvoted a paper about 1 year ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 207

upvoted 3 collections about 1 year ago

upvoted 2 articles about 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 496

Article

SmolVLM2: Bringing Video Understanding to Every Device

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 337