BÌNH MINH
Bk9x
AI & ML interests
None yet
Recent Activity
updated a collection about 1 month ago
Automatic Speech Recognition updated a collection about 1 month ago
TTS updated a collection about 2 months ago
Dataset_NLPOrganizations
Small LM
Embedding
SDXL
- RunningAgentsFeatured183
Flash Scribble SDXL
⚡183Generate images from sketches using SDXL Flash
- Runtime errorAgents1.68k
Flux.1-dev Upscaler
🔎1.68kUpscale low‑resolution images to higher resolution
- Running on ZeroAgents191
Diffusion Self Distillation
🦀191Tuning-free subject-driven generation
LLM
VLM + OCR
-
5CD-AI/Vintern-1B-v2
Image-Text-to-Text • 0.9B • Updated • 527 • 81 -
erax-ai/EraX-VL-7B-V1.0
Image-Text-to-Text • 8B • Updated • 116 • 43 - Running on ZeroAgentsFeatured276
granite-docling-258M demo
📝276Convert images of documents to structured data and answer queries
-
datalab-to/chandra
Image-Text-to-Text • 9B • Updated • 117k • 522
Dataset_NLP
Dataset_voice
Automatic Speech Recognition
-
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 7.29M • • 3.01k -
nguyendv02/ViMD_Dataset
Viewer • Updated • 19k • 1.51k • 18 - RunningAgents46
Automatic Speech Recognition
🌍46Transcribe uploaded, recorded, or online audio to text
-
Qwen/Qwen3-ASR-1.7B
Automatic Speech Recognition • 2B • Updated • 2.04M • 808
TTS
model_NLP
Data_Pretrain_NLP
Dataset_NLP
Small LM
Dataset_voice
Embedding
Automatic Speech Recognition
-
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 7.29M • • 3.01k -
nguyendv02/ViMD_Dataset
Viewer • Updated • 19k • 1.51k • 18 - RunningAgents46
Automatic Speech Recognition
🌍46Transcribe uploaded, recorded, or online audio to text
-
Qwen/Qwen3-ASR-1.7B
Automatic Speech Recognition • 2B • Updated • 2.04M • 808
SDXL
- RunningAgentsFeatured183
Flash Scribble SDXL
⚡183Generate images from sketches using SDXL Flash
- Runtime errorAgents1.68k
Flux.1-dev Upscaler
🔎1.68kUpscale low‑resolution images to higher resolution
- Running on ZeroAgents191
Diffusion Self Distillation
🦀191Tuning-free subject-driven generation
TTS
LLM
model_NLP
VLM + OCR
-
5CD-AI/Vintern-1B-v2
Image-Text-to-Text • 0.9B • Updated • 527 • 81 -
erax-ai/EraX-VL-7B-V1.0
Image-Text-to-Text • 8B • Updated • 116 • 43 - Running on ZeroAgentsFeatured276
granite-docling-258M demo
📝276Convert images of documents to structured data and answer queries
-
datalab-to/chandra
Image-Text-to-Text • 9B • Updated • 117k • 522