Shail Shah
shail-2512
AI & ML interests
None yet
Organizations
LLMs
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.2M • • 2.02k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 2.29M • • 704 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 5.66k • 76 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 116k • 686
Image Generation
3D
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 2.57k • 457 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 84.5k • 976 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 85.8k • 329 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 7.64M • • 3k
Reranking Models
ALMs (Audio Language Models)
TTS
Reasoning (LRMs)
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 29k • 586 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 274 • 1.71k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 16 • 55 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 165k • 1.59k
Video Generation
Dataset to fine-tune Embeddings
Embedding Models
MultiModal (Any-to-Any)
ALMs (Audio Language Models)
LLMs
TTS
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.2M • • 2.02k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 2.29M • • 704 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 5.66k • 76 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 116k • 686
Reasoning (LRMs)
Image Generation
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 29k • 586 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 274 • 1.71k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 16 • 55 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 165k • 1.59k
3D
Video Generation
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 2.57k • 457 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 84.5k • 976 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 85.8k • 329 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 7.64M • • 3k
Dataset to fine-tune Embeddings
Reranking Models
Embedding Models