-
facebook/vjepa2-vitl-fpc64-256
Video Classification β’ 0.3B β’ Updated β’ 49.9k β’ 179 -
microsoft/xclip-base-patch32
Video Classification β’ 0.2B β’ Updated β’ 183k β’ 108 -
MCG-NJU/videomae-base
Video Classification β’ 94.2M β’ Updated β’ 89.6k β’ 49 -
OpenGVLab/VideoMAEv2-Base
Video Classification β’ 86.2M β’ Updated β’ 6.62k β’ 10
Alban NYANTUDRE
anyantudre
AI & ML interests
ML Engineer π¨πΎβπ»| Deep Learning (Vision, Language, Speech)
Recent Activity
liked
a model
about 22 hours ago
deepseek-ai/DeepSeek-OCR-2
updated
a collection
12 days ago
MoorΓ© - Burkina Faso π§π«
liked
a model
12 days ago
google/translategemma-4b-it
Organizations
MoorΓ© - Burkina Faso π§π«
Letβs help MoorΓ© language shine in the world of AI ππ§π«
-
anyantudre/MooreSpeechCorpora
Viewer β’ Updated β’ 5.54k β’ 1 β’ 2 -
Sleeping3
Moore Language Space
π3Demo Space for MoorΓ© language TTS, ASR and translation
-
anyantudre/moore-speech-contes
Viewer β’ Updated β’ 5.96k β’ 1 -
Running1
Moore translation Leaderboard
π1Text2text Machine Translation for Moore language
Spaces β€οΈ
My favorite Spaces.
-
Running on CPU Upgrade976
Open VLM Leaderboard
π976VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured420
moondream1
π420Generate code from text prompts
-
Runtime error20
Ovis2 1B
π¦«20Small model can do big things.
-
Running on Zero4
VQA Autonomous Driving SmolVLM2
π4Visual Question Answering - Autonomous Driving - SmolVLM2
OCR
-
Running on ZeroFeatured267
granite-docling-258M demo
π267Convert images to structured text and answer questions
-
Runtime error36
Multimodal RAG with Granite Vision
π36RAG example using Granite [vision, embedding, instruct]
-
ibm-granite/granite-docling-258M
Image-Text-to-Text β’ 0.3B β’ Updated β’ 206k β’ 1.1k -
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text β’ 3B β’ Updated β’ 3.12M β’ 3.12k
π€πΎ VLMs & LLMs
My favorite small LLMs and VLMs.
-
OpenGVLab/InternVL3-1B
Image-Text-to-Text β’ 0.9B β’ Updated β’ 146k β’ 78 -
vikhyatk/moondream2
Image-Text-to-Text β’ 2B β’ Updated β’ 4.01M β’ 1.37k -
microsoft/Florence-2-base
Image-Text-to-Text β’ 0.2B β’ Updated β’ 377k β’ 337 -
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text β’ 0.3B β’ Updated β’ 108k β’ 95
Video-models
-
facebook/vjepa2-vitl-fpc64-256
Video Classification β’ 0.3B β’ Updated β’ 49.9k β’ 179 -
microsoft/xclip-base-patch32
Video Classification β’ 0.2B β’ Updated β’ 183k β’ 108 -
MCG-NJU/videomae-base
Video Classification β’ 94.2M β’ Updated β’ 89.6k β’ 49 -
OpenGVLab/VideoMAEv2-Base
Video Classification β’ 86.2M β’ Updated β’ 6.62k β’ 10
OCR
-
Running on ZeroFeatured267
granite-docling-258M demo
π267Convert images to structured text and answer questions
-
Runtime error36
Multimodal RAG with Granite Vision
π36RAG example using Granite [vision, embedding, instruct]
-
ibm-granite/granite-docling-258M
Image-Text-to-Text β’ 0.3B β’ Updated β’ 206k β’ 1.1k -
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text β’ 3B β’ Updated β’ 3.12M β’ 3.12k
MoorΓ© - Burkina Faso π§π«
Letβs help MoorΓ© language shine in the world of AI ππ§π«
-
anyantudre/MooreSpeechCorpora
Viewer β’ Updated β’ 5.54k β’ 1 β’ 2 -
Sleeping3
Moore Language Space
π3Demo Space for MoorΓ© language TTS, ASR and translation
-
anyantudre/moore-speech-contes
Viewer β’ Updated β’ 5.96k β’ 1 -
Running1
Moore translation Leaderboard
π1Text2text Machine Translation for Moore language
π€πΎ VLMs & LLMs
My favorite small LLMs and VLMs.
-
OpenGVLab/InternVL3-1B
Image-Text-to-Text β’ 0.9B β’ Updated β’ 146k β’ 78 -
vikhyatk/moondream2
Image-Text-to-Text β’ 2B β’ Updated β’ 4.01M β’ 1.37k -
microsoft/Florence-2-base
Image-Text-to-Text β’ 0.2B β’ Updated β’ 377k β’ 337 -
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text β’ 0.3B β’ Updated β’ 108k β’ 95
Spaces β€οΈ
My favorite Spaces.
-
Running on CPU Upgrade976
Open VLM Leaderboard
π976VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured420
moondream1
π420Generate code from text prompts
-
Runtime error20
Ovis2 1B
π¦«20Small model can do big things.
-
Running on Zero4
VQA Autonomous Driving SmolVLM2
π4Visual Question Answering - Autonomous Driving - SmolVLM2