Marco
AI & ML interests
Recent Activity
Organizations
-
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text • 0.6B • Updated • 173k • 234 - Running on ZeroAgents84
GOT OCR Transformers
📷84Demo of GOT-OCR 2.0's Transformers implementation
-
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text • 8B • Updated • 21.6k • 708 -
allenai/olmOCR-mix-0225
Viewer • Updated • 259k • 747 • 171
- Running557
DeepSeek-R1 WebGPU
🧠557Next-generation reasoning model that runs locally in-browser
- PausedAgents101
Qwen2.5-1M Demo
💻101Ask questions about your uploaded documents
-
mistralai/Mistral-Small-24B-Base-2501
24B • Updated • 5.93k • 262 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 6.05k • 179
- RunningAgentsFeatured266
Qwen3 Omni Demo
⚡266Chat with AI using text, audio, images, or video
- RunningAgents64
Qwen3 Omni Captioner Demo
🐠64Generate a caption for any uploaded or recorded audio
-
Qwen/Qwen3-Omni-30B-A3B-Thinking
Any-to-Any • 32B • Updated • 336k • 308 -
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any • 35B • Updated • 2M • 944
- RunningMCP134
Consilium MCP Server
🏢134Multi-AI Expert Consensus Platform
- Runtime errorMCP2
MCP Hackathon Deepfake Watchdog
🛡2Upload your image and/or voice to scan for deepfake misuse o
- Runtime error36
VulnBuster
🛡36AI Security Agent: Multi-MCP Code Vulnerability Scanner
- RunningMCP200
AI Marketing Content Generator
🎨200An AI-powered tool made for content creators and marketers
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition • Updated • 402k • 1.51k - Running on ZeroAgentsFeatured474
Parakeet-TDT-0.6b-V2
474Transcribe audio files with timestamps and downloadable subtitles
- Runtime errorAgents33
Blazing Fast Whisper
👁33Blazing Fast Whisper Deployed on HF Inference Endpoints
- Running on CPU UpgradeAgentsFeatured1.38k
Open ASR Leaderboard
🏆1.38kExplore and compare speech‑recognition model benchmarks
- Running on T4Agents148
RF-DETR
🔥148SOTA real-time object detection model
- Running on CPU UpgradeAgents50
YOLO ARENA
🏟50compare performance of top object detectors
- Running on ZeroAgentsFeatured92
D-Fine - SOTA Real-Time Object Detector
⚡92Object Detection on Images and Video
- Running on ZeroMCP31
Gaze LLE
👀31Gaze Target Estimation
-
flax-community/t5-recipe-generation
Text Generation • 0.2B • Updated • 1.5k • 76 -
numind/NuExtract-1.5
Text Generation • 4B • Updated • 1.43k • 247 - RunningAgents11
Signature Detection
👁11Handwritten Signature Detection
-
manycore-research/SpatialLM-Llama-1B
Text Generation • 1B • Updated • 123 • 994
- Running on ZeroMCPFeatured605
LatentSync
👄605Audio Conditioned LipSync with Latent Diffusion Models
- PausedAgents228
BEN2
🚀228Remove background from images and videos
- Build errorAgents81
SmolVLM
📊81Generate answers by combining text and images
- Build errorAgents59
SmolVLM2 HighlightGenerator
🐨59Generate video highlights from uploaded video
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 2.67k • 171 -
kyutai/hibiki-2b-pytorch-bf16
Translation • Updated • 587 • 63 -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech • 2B • Updated • 1.8k • 1.11k - Running on ZeroAgentsFeatured688
Di♪♪Rhythm
🎶688Blazingly Fast and Embarrassingly Simple Song Generation
-
onnx-community/Kokoro-82M-ONNX
Text-to-Speech • Updated • 47.5k • 178 - Running223
Kokoro Text-to-Speech
🗣223High-quality speech synthesis powered by Kokoro TTS
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 2.67k • 171 -
jonatasgrosman/wav2vec2-large-xlsr-53-english
Automatic Speech Recognition • 0.3B • Updated • 66.4k • 476
- RunningAgentsFeatured266
Qwen3 Omni Demo
⚡266Chat with AI using text, audio, images, or video
- RunningAgents64
Qwen3 Omni Captioner Demo
🐠64Generate a caption for any uploaded or recorded audio
-
Qwen/Qwen3-Omni-30B-A3B-Thinking
Any-to-Any • 32B • Updated • 336k • 308 -
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any • 35B • Updated • 2M • 944
- RunningMCP134
Consilium MCP Server
🏢134Multi-AI Expert Consensus Platform
- Runtime errorMCP2
MCP Hackathon Deepfake Watchdog
🛡2Upload your image and/or voice to scan for deepfake misuse o
- Runtime error36
VulnBuster
🛡36AI Security Agent: Multi-MCP Code Vulnerability Scanner
- RunningMCP200
AI Marketing Content Generator
🎨200An AI-powered tool made for content creators and marketers
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition • Updated • 402k • 1.51k - Running on ZeroAgentsFeatured474
Parakeet-TDT-0.6b-V2
474Transcribe audio files with timestamps and downloadable subtitles
- Runtime errorAgents33
Blazing Fast Whisper
👁33Blazing Fast Whisper Deployed on HF Inference Endpoints
- Running on CPU UpgradeAgentsFeatured1.38k
Open ASR Leaderboard
🏆1.38kExplore and compare speech‑recognition model benchmarks
- Running on T4Agents148
RF-DETR
🔥148SOTA real-time object detection model
- Running on CPU UpgradeAgents50
YOLO ARENA
🏟50compare performance of top object detectors
- Running on ZeroAgentsFeatured92
D-Fine - SOTA Real-Time Object Detector
⚡92Object Detection on Images and Video
- Running on ZeroMCP31
Gaze LLE
👀31Gaze Target Estimation
-
flax-community/t5-recipe-generation
Text Generation • 0.2B • Updated • 1.5k • 76 -
numind/NuExtract-1.5
Text Generation • 4B • Updated • 1.43k • 247 - RunningAgents11
Signature Detection
👁11Handwritten Signature Detection
-
manycore-research/SpatialLM-Llama-1B
Text Generation • 1B • Updated • 123 • 994
-
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text • 0.6B • Updated • 173k • 234 - Running on ZeroAgents84
GOT OCR Transformers
📷84Demo of GOT-OCR 2.0's Transformers implementation
-
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text • 8B • Updated • 21.6k • 708 -
allenai/olmOCR-mix-0225
Viewer • Updated • 259k • 747 • 171
- Running on ZeroMCPFeatured605
LatentSync
👄605Audio Conditioned LipSync with Latent Diffusion Models
- PausedAgents228
BEN2
🚀228Remove background from images and videos
- Build errorAgents81
SmolVLM
📊81Generate answers by combining text and images
- Build errorAgents59
SmolVLM2 HighlightGenerator
🐨59Generate video highlights from uploaded video
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 2.67k • 171 -
kyutai/hibiki-2b-pytorch-bf16
Translation • Updated • 587 • 63 -
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech • 2B • Updated • 1.8k • 1.11k - Running on ZeroAgentsFeatured688
Di♪♪Rhythm
🎶688Blazingly Fast and Embarrassingly Simple Song Generation
- Running557
DeepSeek-R1 WebGPU
🧠557Next-generation reasoning model that runs locally in-browser
- PausedAgents101
Qwen2.5-1M Demo
💻101Ask questions about your uploaded documents
-
mistralai/Mistral-Small-24B-Base-2501
24B • Updated • 5.93k • 262 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 6.05k • 179
-
onnx-community/Kokoro-82M-ONNX
Text-to-Speech • Updated • 47.5k • 178 - Running223
Kokoro Text-to-Speech
🗣223High-quality speech synthesis powered by Kokoro TTS
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 2.67k • 171 -
jonatasgrosman/wav2vec2-large-xlsr-53-english
Automatic Speech Recognition • 0.3B • Updated • 66.4k • 476