🏗️ Building on HF

1 55 70

Adarsh Zolekar

adarshzolekar

PhysiQuanty's profile picture

shubhamgreat's profile picture

HustleMoney's profile picture

adarsh_zolekar
AdarshZolekar
adarshzolekar
adarshzolekar.bsky.social

AI & ML interests

Passionate about AI & ML, Deep Learning and related AI domains. Exploring models, datasets and applications while contributing to the Hugging Face community.

Recent Activity

upvoted a paper 5 days ago

Looped World Models

liked a model 5 days ago

google/diffusiongemma-26B-A4B-it

liked a model 5 days ago

zai-org/GLM-5.2

View all activity

Organizations

adarshzolekar 's collections 4

Multimodal AI Models

Purpose: Models that understand text + image + audio together.

llava-hf/llava-1.5-7b-hf

Image-Text-to-Text • 7B • Updated Jun 6, 2025 • 3.13M • 366
Salesforce/blip-image-captioning-base

Image-to-Text • Updated Feb 3, 2025 • 2.1M • 861
google/pix2struct-base

Image-to-Text • 0.3B • Updated Dec 24, 2023 • 2.98k • 79
microsoft/kosmos-2-patch14-224

Image-to-Text • 2B • Updated Nov 28, 2023 • 161k • 185

Vision Models (Image & Video)

Purpose: Text-to-image, image classification, detection, segmentation.

stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 1.37M • • 7.86k
rupeshs/LCM-runwayml-stable-diffusion-v1-5

Text-to-Image • Updated Nov 12, 2023 • 57 • 30
openai/clip-vit-base-patch32

Zero-Shot Image Classification • Updated Feb 29, 2024 • 23.2M • 963
facebook/detr-resnet-50

Object Detection • 41.6M • Updated Apr 10, 2024 • 745k • • 956

Audio & Speech Models

Purpose: Speech recognition, text-to-speech, music, audio analysis.

openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 5.73M • • 5.88k
facebook/wav2vec2-base-960h

Automatic Speech Recognition • 94.4M • Updated Nov 14, 2022 • 1.31M • 398
coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 9.69M • 3.62k
microsoft/speecht5_tts

Text-to-Speech • Updated Nov 8, 2023 • 91.3k • 835

Text & Code Models (NLP)

Purpose: Text generation, summarization, translation, embeddings, coding.

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 10M • • 6.15k
mistralai/Mistral-7B-Instruct-v0.3

7B • Updated Dec 3, 2025 • 2.81M • 2.66k
google/gemma-7b

Text Generation • 9B • Updated Jun 27, 2024 • 20.7k • • 3.36k
bigscience/bloom

Text Generation • 176B • Updated Jul 28, 2023 • 5.46k • 5.01k