26 70

Suraj

ghishadow

AI & ML interests

None yet

Recent Activity

liked a model 15 days ago

deepseek-ai/DeepSeek-V4-Pro

liked a model 17 days ago

moonshotai/Kimi-K2.6

liked a model about 1 month ago

LiquidAI/LFM2.5-350M

View all activity

Organizations

liked a model 15 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 3 days ago • 1.06M • • 3.76k

liked a model 17 days ago

moonshotai/Kimi-K2.6

Image-Text-to-Text • 1.1T • Updated 9 days ago • 1.15M • • 1.23k

liked a model about 1 month ago

LiquidAI/LFM2.5-350M

Text Generation • 0.4B • Updated Apr 1 • 100k • 296

upvoted a collection about 1 month ago

Bonsai

Collection

1-bit Bonsai models • 7 items • Updated 21 days ago • 191

liked a Space about 2 months ago

MinerU Document Extraction Tools

📚

595

Easy converting PDF and Office docs into Markdown and JSON

liked a dataset about 2 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 467k • 2.78k

liked a model about 2 months ago

bharatgenai/Param2-17B-A2.4B-Thinking

Text Generation • 17B • Updated 29 days ago • 26.8k • 65

upvoted a paper 2 months ago

Unified Latents (UL): How to train your latents

Paper • 2602.17270 • Published Feb 19 • 60

liked a model 2 months ago

facebook/sam3

Mask Generation • 0.9B • Updated Nov 20, 2025 • 3.06M • 1.96k

upvoted an article 2 months ago

Article

Small Language Models (SLM): A Comprehensive Overview

Feb 22, 2025

•

149

liked a Space 2 months ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

upvoted 2 articles 2 months ago

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

Dec 18, 2024

•

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Feb 20

•

505

liked a model 4 months ago

LiquidAI/LFM2-2.6B-Exp

Text Generation • 3B • Updated Mar 30 • 6.49k • 339

liked 2 models 5 months ago

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20, 2025 • 95.8k • 112

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 56.9k • • 560

upvoted a collection 5 months ago

Ministral 3

Collection

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 16 days ago • 35

liked a model 5 months ago

litert-community/Gemma3-1B-IT

Text Generation • Updated 17 days ago • 17.5k • 580

liked a model 6 months ago

maya-research/maya1

Text-to-Speech • Updated Nov 12, 2025 • 5.79k • 878

upvoted a paper 7 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 50

Suraj

AI & ML interests

Recent Activity

Organizations

ghishadow's activity

MinerU Document Extraction Tools

Small Language Models (SLM): A Comprehensive Overview

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Bamba: Inference-Efficient Hybrid Mamba2 Model

GGML and llama.cpp join HF to ensure the long-term progress of Local AI