Backlog to try - a nkaushik Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

nkaushik 's Collections

Visual Understanding Models

AudioRefinement

Backlog to try

updated Jun 7

fashn-ai/fashn-vton-1.5

Image-to-Image • 1.0B • Updated Feb 1 • 121
unsloth/Z-Image-GGUF

Text-to-Image • 6B • Updated Jan 28 • 10.7k • 186
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Text-to-Speech • 2B • Updated Jan 29 • 2.5M • 1.84k
Qwen/Qwen3-TTS-12Hz-1.7B-Base

2B • Updated Jan 23 • 2.46M • 458
unsloth/FLUX.2-klein-9B-GGUF

Image-to-Image • 9B • Updated Jan 16 • 88.9k • 289
unsloth/FLUX.2-klein-base-4B-GGUF

Image-to-Image • 4B • Updated Jan 15 • 3.66k • 23
unsloth/FLUX.2-klein-base-9B-GGUF

Image-to-Image • 9B • Updated Jan 15 • 10.3k • 37
DevParker/VibeVoice7b-low-vram

Text-to-Speech • Updated Oct 23, 2025 • 72
ACE-Step/Ace-Step1.5

Text-to-Audio • Updated Feb 3 • 59.6k • 808
circlestone-labs/Anima

Updated 5 days ago • 828k • 1.97k
PaddlePaddle/PaddleOCR-VL-1.5

Image-Text-to-Text • 1.0B • Updated 19 days ago • 25.1k • 657
sarvamai/sarvam-1

Text Generation • 3B • Updated Nov 8, 2024 • 9.03k • 142
lightonai/LightOnOCR-2-1B

Image-Text-to-Text • 1B • Updated 21 days ago • 341k • 776
PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated Jun 27 • 48.1k • 1.64k
LocoreMind/LocoOperator-4B

Text Generation • 4B • Updated Feb 24 • 215 • • 278
unsloth/Qwen3.5-9B-GGUF

Image-Text-to-Text • 9B • Updated Mar 2 • 988k • 794
UsefulSensors/moonshine-tiny

Automatic Speech Recognition • 27.1M • Updated Jan 30, 2025 • 200k • 48
k2-fsa/OmniVoice

Text-to-Speech • 0.6B • Updated 27 days ago • 848k • 1.22k

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs