Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Milos Bijanic's picture

Milos Bijanic

biki96

Ameeeee's profile picture

frascuchon's profile picture

·

AI & ML interests

None yet

Organizations

None yet

biki96 's collections 13

image-text-to-video

OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 96.6k • 215
Lightricks/LTX-2

Image-to-Video • Updated Mar 2 • 557k • • 1.75k
GAIR/daVinci-MagiHuman

Image-to-Video • Updated Mar 25 • 150 • 331

Alissonerdx/BFS-Best-Face-Swap-Video

Image-to-Video • Updated 5 days ago • 224 • 340

nvidia/personaplex-7b-v1

Audio-to-Audio • 8B • Updated Mar 2 • 362k • 2.57k

Qwen/Qwen-Image-Layered

Image-Text-to-Image • Updated Dec 19, 2025 • 51.6k • 1.11k
Tongyi-MAI/Z-Image

Text-to-Image • Updated Jan 28 • 21.8k • • 1.16k
Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated Jan 30 • 894k • • 4.87k
alibaba-pai/Z-Image-Fun-Lora-Distill

Text-to-Image • Updated Mar 2 • 11.5k • 164

microsoft/TRELLIS.2-4B

Image-to-3D • Updated Dec 27, 2025 • 966k • 959
tencent/HY-WorldPlay

Image-to-Video • Updated Mar 6 • 469 • 518
zai-org/SCAIL-Preview

Updated Dec 16, 2025 • 111

stabilityai/stable-video-diffusion-img2vid

Image-to-Video • Updated Jul 10, 2024 • 33k • 1.04k

DreamGaussian4D: Generative 4D Gaussian Splatting

Paper • 2312.17142 • Published Dec 28, 2023 • 19
Presto! Distilling Steps and Layers for Accelerating Music Generation

Paper • 2410.05167 • Published Oct 7, 2024 • 18
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction

Paper • 2410.04932 • Published Oct 7, 2024 • 9
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

Paper • 2410.11795 • Published Oct 15, 2024 • 18

deepgenteam/DeepGen-1.0

Text-to-Image • Updated Mar 2 • 12 • 180
Skywork/UniPic2-SD3.5M-Kontext-2B

Any-to-Any • Updated Sep 8, 2025 • 25 • 24
meituan-longcat/LongCat-Image

Text-to-Image • Updated Dec 16, 2025 • 20.8k • • 246
OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 96.6k • 215

llm-semantic-router/multi-modal-embed-small

Sentence Similarity • 0.3B • Updated May 12 • 1.89k • • 21
NeuML/bert-hash-femto

240k • Updated 10 days ago • 312 • 20

ResembleAI/chatterbox-turbo

Text-to-Speech • Updated Dec 15, 2025 • • 656
Running on Zero

MCP

Featured

149

Soprano TTS

🗣

149

Now with upgraded v1.1 model!
HumeAI/tada-1b

Text-to-Speech • 2B • Updated Mar 17 • 7.99k • 238
ACE-Step/acestep-v15-xl-turbo

Text-to-Audio • 5B • Updated Apr 7 • 4.12k • 182

google/functiongemma-270m-it

Text Generation • 0.3B • Updated Jan 14 • 16.1k • 1.02k
google/medasr

Automatic Speech Recognition • 0.1B • Updated 30 days ago • 58k • 338
zai-org/GLM-4.7-Flash

Text Generation • 31B • Updated Jan 29 • 2.1M • • 1.76k
Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 3.49k • • 1.14k

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.22M • 3.29k
allenai/olmOCR-2-7B-1025-FP8

Image-Text-to-Text • 8B • Updated Feb 19 • 784k • 242
allenai/olmOCR-2-7B-1025

Image-Text-to-Text • 8B • Updated Oct 22, 2025 • 28.6k • 151
PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 28 days ago • 7.39k • 1.63k

Running on CPU Upgrade

Agents

Featured

1.38k

Open ASR Leaderboard

🏆

1.38k

Explore and compare speech‑recognition model benchmarks
nvidia/canary-qwen-2.5b

Automatic Speech Recognition • 3B • Updated Apr 21 • 111k • 441
nvidia/parakeet-tdt-0.6b-v3

Automatic Speech Recognition • 0.6B • Updated May 20 • 169k • • 946
nvidia/parakeet-tdt-0.6b-v2

Automatic Speech Recognition • Updated Apr 13 • 402k • 1.51k

image-text-to-video

OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 96.6k • 215
Lightricks/LTX-2

Image-to-Video • Updated Mar 2 • 557k • • 1.75k
GAIR/daVinci-MagiHuman

Image-to-Video • Updated Mar 25 • 150 • 331

deepgenteam/DeepGen-1.0

Text-to-Image • Updated Mar 2 • 12 • 180
Skywork/UniPic2-SD3.5M-Kontext-2B

Any-to-Any • Updated Sep 8, 2025 • 25 • 24
meituan-longcat/LongCat-Image

Text-to-Image • Updated Dec 16, 2025 • 20.8k • • 246
OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 96.6k • 215

Alissonerdx/BFS-Best-Face-Swap-Video

Image-to-Video • Updated 5 days ago • 224 • 340

llm-semantic-router/multi-modal-embed-small

Sentence Similarity • 0.3B • Updated May 12 • 1.89k • • 21
NeuML/bert-hash-femto

240k • Updated 10 days ago • 312 • 20

nvidia/personaplex-7b-v1

Audio-to-Audio • 8B • Updated Mar 2 • 362k • 2.57k

ResembleAI/chatterbox-turbo

Text-to-Speech • Updated Dec 15, 2025 • • 656
Running on Zero

MCP

Featured

149

Soprano TTS

🗣

149

Now with upgraded v1.1 model!
HumeAI/tada-1b

Text-to-Speech • 2B • Updated Mar 17 • 7.99k • 238
ACE-Step/acestep-v15-xl-turbo

Text-to-Audio • 5B • Updated Apr 7 • 4.12k • 182

Qwen/Qwen-Image-Layered

Image-Text-to-Image • Updated Dec 19, 2025 • 51.6k • 1.11k
Tongyi-MAI/Z-Image

Text-to-Image • Updated Jan 28 • 21.8k • • 1.16k
Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated Jan 30 • 894k • • 4.87k
alibaba-pai/Z-Image-Fun-Lora-Distill

Text-to-Image • Updated Mar 2 • 11.5k • 164

google/functiongemma-270m-it

Text Generation • 0.3B • Updated Jan 14 • 16.1k • 1.02k
google/medasr

Automatic Speech Recognition • 0.1B • Updated 30 days ago • 58k • 338
zai-org/GLM-4.7-Flash

Text Generation • 31B • Updated Jan 29 • 2.1M • • 1.76k
Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 3.49k • • 1.14k

microsoft/TRELLIS.2-4B

Image-to-3D • Updated Dec 27, 2025 • 966k • 959
tencent/HY-WorldPlay

Image-to-Video • Updated Mar 6 • 469 • 518
zai-org/SCAIL-Preview

Updated Dec 16, 2025 • 111

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.22M • 3.29k
allenai/olmOCR-2-7B-1025-FP8

Image-Text-to-Text • 8B • Updated Feb 19 • 784k • 242
allenai/olmOCR-2-7B-1025

Image-Text-to-Text • 8B • Updated Oct 22, 2025 • 28.6k • 151
PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 28 days ago • 7.39k • 1.63k

stabilityai/stable-video-diffusion-img2vid

Image-to-Video • Updated Jul 10, 2024 • 33k • 1.04k

Running on CPU Upgrade

Agents

Featured

1.38k

Open ASR Leaderboard

🏆

1.38k

Explore and compare speech‑recognition model benchmarks
nvidia/canary-qwen-2.5b

Automatic Speech Recognition • 3B • Updated Apr 21 • 111k • 441
nvidia/parakeet-tdt-0.6b-v3

Automatic Speech Recognition • 0.6B • Updated May 20 • 169k • • 946
nvidia/parakeet-tdt-0.6b-v2

Automatic Speech Recognition • Updated Apr 13 • 402k • 1.51k

DreamGaussian4D: Generative 4D Gaussian Splatting

Paper • 2312.17142 • Published Dec 28, 2023 • 19
Presto! Distilling Steps and Layers for Accelerating Music Generation

Paper • 2410.05167 • Published Oct 7, 2024 • 18
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction

Paper • 2410.04932 • Published Oct 7, 2024 • 9
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

Paper • 2410.11795 • Published Oct 15, 2024 • 18

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs