10 47

srivatsa

srivatsa92

devsrivatsa

AI & ML interests

rag, agents, fine-tuning

Recent Activity

liked a model 12 days ago

deepseek-ai/DeepSeek-V4-Pro

liked a model about 2 months ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

liked a Space about 2 months ago

lm-provers/qed-nano-blogpost

View all activity

Organizations

liked a model 12 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 2 days ago • 946k • • 3.72k

liked a model about 2 months ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

Text Generation • 121B • Updated Mar 20 • 13.2k • 116

liked a Space about 2 months ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

liked a dataset 2 months ago

google/mobile-actions

Viewer • Updated Dec 18, 2025 • 9.65k • 1.64k • 268

liked a model 4 months ago

ai21labs/AI21-Jamba2-3B

Text Generation • Updated Feb 2 • 1.2k • 42

liked a Space 6 months ago

The Smol Training Playbook

📚

3.15k

The secrets to building world-class LLMs

upvoted a collection 6 months ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 42

liked a dataset 6 months ago

bigcode/the-stack

Viewer • Updated Apr 13, 2023 • 546M • 15.2k • 992

upvoted an article 6 months ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

209

liked a Space 6 months ago

Open ASR Leaderboard

🏆

1.33k

Explore and compare speech recognition model benchmarks

liked a dataset 8 months ago

neerajaabhyankar/hindustani-raag-small

Viewer • Updated Mar 20, 2024 • 1.25k • 8 • 3

upvoted 2 articles 9 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

Article

Efficient Request Queueing – Optimizing LLM Performance

Apr 2, 2025

•

updated a Space 9 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

published a Space 9 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

liked a model 10 months ago

Comfy-Org/Wan_2.1_ComfyUI_repackaged

Updated Jan 28 • 3.54M • 885

liked 2 datasets 11 months ago

vidore/colpali_train_set

Viewer • Updated Jun 20, 2025 • 119k • 6.91k • 91

llamaindex/vdr-multilingual-train

Viewer • Updated Jan 10, 2025 • 424k • 2.02k • 30

liked 2 models 11 months ago

unsloth/Nanonets-OCR-s-GGUF

Image-Text-to-Text • 3B • Updated Jul 3, 2025 • 6.02k • 63

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 145k • 1.59k

srivatsa

AI & ML interests

Recent Activity

Organizations

srivatsa92's activity

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

The Smol Training Playbook

Let's talk about LLM evaluation

Open ASR Leaderboard

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Efficient Request Queueing – Optimizing LLM Performance

GPU VRAM Estimator

GPU VRAM Estimator