Benjamim Alves Nepomuceno Neto
AI & ML interests
Recent Activity
Organizations
-
Running on ZeroFeatured903
MMAudio β generating synchronized audio from video/text
π903Generate audio from video or text prompts
-
Running on Zero323
TangoFlux
π323Text to Audio (Sound SFX) Generator
-
Running on Zero449
Stable Audio Open Zero
π₯449Generate audio from text prompts
-
PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
-
Running370
PDF Chatbot
π370Ask questions about PDFs using a chatbot
-
Runtime errorFeatured367
Video Transcription Smart Summary
β‘367Generate summaries from YouTube videos or uploaded videos
-
Build error90
Quantized Retrieval
π90Efficient quantized retrieval over Wikipedia
-
RunningFeatured1.24k
FineWeb: decanting the web for the finest text data at scale
π·1.24kGenerate high-quality text data for LLMs using FineWeb
-
Running38
Anime Image Classification
π38Analyze anime images for various attributes
-
Running on ZeroFeatured169
PaintsUndo
π¨169Generate key frames and videos from a single image
-
Running on Zero158
Kolors IP-Adapter
πΌ158Generate images using text and reference images
-
Running on ZeroFeatured2.06k
PuLID-FLUX
π€2.06kGenerate images from text prompts and ID images
-
Runtime errorFeatured93
Panoptic Segment Anything
πΌ93 -
Runtime errorFeatured396
Grounded Segment Anything
π396 -
Running on Zero196
Inspyrenet Remove Background
π’196Remove background from images
-
Runtime errorFeatured515
Florence2 + SAM2
π₯515Segment and caption objects in images and videos
-
RunningFeatured108
BigVGAN
π108Generate high-quality audio from input audio
-
Running24
Audio Emotion Recognition
πΌ24Detect emotions from audio recordings
-
PausedFeatured61
SoundwaveDemo
π61Process audio and generate text output based on instructions
-
RunningFeatured67
DiffVox
π¦67Enhance vocals with professional effects using sliders
-
MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification β’ 86.6M β’ Updated β’ 339k β’ 333 -
Running on Zero312
Llasa 3b Tts
π₯312Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
-
Running on ZeroFeatured411
Zonos
π411Generate audio from text with customizable emotions and settings
-
RunningFeatured1.95k
Wan2.1
π»1.95kWan: Open and Advanced Large-Scale Video Generative Models
-
Running on ZeroFeatured184
FramePack_rotate_landscape
π¬184Generate videos from images and prompts
-
Running on ZeroMCPFeatured1.58k
Wan2.1 Fast
π₯1.58kGenerate a video from an image with a prompt
-
Running on ZeroFeatured72
NAG Wan2-1-fast
π’72Demo of Normalized Attention Guidance for 4 steps Wan2.1
-
Running33
Mediapipe Face Mesh 3d
π33create 3d-gltf face-mesh from image with mediapipe
-
Running5
Mediapipe Head Pose Estimation
π52 head pose estimation with mediapipe and trained-model
-
Running9
Mediapipe 68 Points Facial Mask
β‘9create facial masks from 68 points landmark
-
Running on ZeroFeatured1.09k
InfiniteYou-FLUX
πΈ1.09kFlexible Photo Recrafting While Preserving Your Identity
-
Build error116
Dpt Depth Estimation + 3D Voxels
π§116Create 3D models from images using depth estimation
-
Running on Zero3.13k
Hunyuan3D-2.0
π3.13kText-to-3D and Image-to-3D Generation
-
Running on ZeroFeatured4.78k
TRELLIS
π’4.78kScalable and Versatile 3D Generation from images
-
Running on ZeroFeatured213
Video Depth Anything
π213Generate depth video from input video
-
RunningFeatured175
Manimator
π175Transform research papers and mathematical concepts into stu
-
Running on ZeroFeatured175
Gaze Demo
π175Gaze detection using Moondream
-
Running11
Metropolitan Museum
π¨11The Metropolitan Museum of Art Collection
-
SleepingFeatured112
CountGD_Multi-Modal_Open-World_Counting
π112Count objects in images using text, visual examples, or both
-
Running on ZeroFeatured555
Midi Music Generator
πΌ555Generate MIDI music from prompts
-
PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
-
Paused51
Open SUNO
π©51Your Lyrics into Complete Songs with Vocals in Multilingual
-
Running on ZeroFeatured665
DiβͺβͺRhythm
πΆ665Blazingly Fast and Embarrassingly Simple Song Generation
-
Running on ZeroFeatured260
SD3 Long Captioner
π260Generate detailed captions for images
-
Runtime errorFeatured111
ChartGemma
π¨111Generate insights from charts using text prompts
-
Running on Zero90
AuraFlow-v0.3 with Captioner
πΌ90Generate images from prompts or images
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification β’ 0.4B β’ Updated β’ 8.15M β’ 1.94k
-
Runtime errorFeatured461
Omni-Zero
π§461Restylize & repose person ID
-
Running on Zero1.21k
PhotoMaker V2
π·1.21kGenerate customized realistic photos from face images
-
Runtime errorFeatured642
FLUX.1 [Inpainting]
π¨642 -
Running on L40SFeatured1.59k
Expression Editor
π¨1.59kQuickly edit the expression of a face
-
RunningFeatured1.95k
Wan2.1
π»1.95kWan: Open and Advanced Large-Scale Video Generative Models
-
Running on ZeroFeatured184
FramePack_rotate_landscape
π¬184Generate videos from images and prompts
-
Running on ZeroMCPFeatured1.58k
Wan2.1 Fast
π₯1.58kGenerate a video from an image with a prompt
-
Running on ZeroFeatured72
NAG Wan2-1-fast
π’72Demo of Normalized Attention Guidance for 4 steps Wan2.1
-
Running33
Mediapipe Face Mesh 3d
π33create 3d-gltf face-mesh from image with mediapipe
-
Running5
Mediapipe Head Pose Estimation
π52 head pose estimation with mediapipe and trained-model
-
Running9
Mediapipe 68 Points Facial Mask
β‘9create facial masks from 68 points landmark
-
Running on ZeroFeatured1.09k
InfiniteYou-FLUX
πΈ1.09kFlexible Photo Recrafting While Preserving Your Identity
-
Build error116
Dpt Depth Estimation + 3D Voxels
π§116Create 3D models from images using depth estimation
-
Running on Zero3.13k
Hunyuan3D-2.0
π3.13kText-to-3D and Image-to-3D Generation
-
Running on ZeroFeatured4.78k
TRELLIS
π’4.78kScalable and Versatile 3D Generation from images
-
Running on ZeroFeatured213
Video Depth Anything
π213Generate depth video from input video
-
RunningFeatured175
Manimator
π175Transform research papers and mathematical concepts into stu
-
Running on ZeroFeatured175
Gaze Demo
π175Gaze detection using Moondream
-
Running11
Metropolitan Museum
π¨11The Metropolitan Museum of Art Collection
-
SleepingFeatured112
CountGD_Multi-Modal_Open-World_Counting
π112Count objects in images using text, visual examples, or both
-
Running on ZeroFeatured903
MMAudio β generating synchronized audio from video/text
π903Generate audio from video or text prompts
-
Running on Zero323
TangoFlux
π323Text to Audio (Sound SFX) Generator
-
Running on Zero449
Stable Audio Open Zero
π₯449Generate audio from text prompts
-
PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
-
Running on ZeroFeatured555
Midi Music Generator
πΌ555Generate MIDI music from prompts
-
PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
-
Paused51
Open SUNO
π©51Your Lyrics into Complete Songs with Vocals in Multilingual
-
Running on ZeroFeatured665
DiβͺβͺRhythm
πΆ665Blazingly Fast and Embarrassingly Simple Song Generation
-
Running on ZeroFeatured260
SD3 Long Captioner
π260Generate detailed captions for images
-
Runtime errorFeatured111
ChartGemma
π¨111Generate insights from charts using text prompts
-
Running on Zero90
AuraFlow-v0.3 with Captioner
πΌ90Generate images from prompts or images
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification β’ 0.4B β’ Updated β’ 8.15M β’ 1.94k
-
Running370
PDF Chatbot
π370Ask questions about PDFs using a chatbot
-
Runtime errorFeatured367
Video Transcription Smart Summary
β‘367Generate summaries from YouTube videos or uploaded videos
-
Build error90
Quantized Retrieval
π90Efficient quantized retrieval over Wikipedia
-
RunningFeatured1.24k
FineWeb: decanting the web for the finest text data at scale
π·1.24kGenerate high-quality text data for LLMs using FineWeb
-
Runtime errorFeatured461
Omni-Zero
π§461Restylize & repose person ID
-
Running on Zero1.21k
PhotoMaker V2
π·1.21kGenerate customized realistic photos from face images
-
Runtime errorFeatured642
FLUX.1 [Inpainting]
π¨642 -
Running on L40SFeatured1.59k
Expression Editor
π¨1.59kQuickly edit the expression of a face
-
Running38
Anime Image Classification
π38Analyze anime images for various attributes
-
Running on ZeroFeatured169
PaintsUndo
π¨169Generate key frames and videos from a single image
-
Running on Zero158
Kolors IP-Adapter
πΌ158Generate images using text and reference images
-
Running on ZeroFeatured2.06k
PuLID-FLUX
π€2.06kGenerate images from text prompts and ID images
-
Runtime errorFeatured93
Panoptic Segment Anything
πΌ93 -
Runtime errorFeatured396
Grounded Segment Anything
π396 -
Running on Zero196
Inspyrenet Remove Background
π’196Remove background from images
-
Runtime errorFeatured515
Florence2 + SAM2
π₯515Segment and caption objects in images and videos
-
RunningFeatured108
BigVGAN
π108Generate high-quality audio from input audio
-
Running24
Audio Emotion Recognition
πΌ24Detect emotions from audio recordings
-
PausedFeatured61
SoundwaveDemo
π61Process audio and generate text output based on instructions
-
RunningFeatured67
DiffVox
π¦67Enhance vocals with professional effects using sliders
-
MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification β’ 86.6M β’ Updated β’ 339k β’ 333 -
Running on Zero312
Llasa 3b Tts
π₯312Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
PausedFeatured202
YuE
π©202Generate music from lyrics and genre tags
-
Running on ZeroFeatured411
Zonos
π411Generate audio from text with customizable emotions and settings