bigbrotherr (Ampomah Theophilus)

liked a model 10 months ago

IndexTeam/IndexTTS-2

Text-to-Speech • Updated Jan 20 • 14.3k • 741

liked 2 models about 1 year ago

black-forest-labs/FLUX.1-Kontext-dev

Image-to-Image • Updated Jan 1 • 150k • • 2.69k

prithivMLmods/Qwen2-VL-OCR-2B-Instruct

Image-Text-to-Text • 2B • Updated May 2, 2025 • 1.7k • 103

liked 4 Spaces about 1 year ago

Multimodal OCR

🍍

414

Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

🎙

216

Generate speech from text using a reference audio

TripoSG

🔮

883

Create a textured 3D model from a single image

DeepSite v4

🐳

16.6k

Generate any application by Vibe Coding it

liked a model over 1 year ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 519k • 1.61k

liked 2 Spaces over 1 year ago

Kokoro TTS

❤

3.38k

Upgraded to v1.0!

BEN2

🚀

228

Remove background from images and videos

liked a model over 1 year ago

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6, 2025 • 5.24M • 669

liked 5 Spaces over 1 year ago

Stable Point-Aware 3D

⚡

468

Generate 3D models from images

TRELLIS

🏢

4.78k

Scalable and Versatile 3D Generation from images

Qwen2.5 Coder Artifacts

🐢

1.73k

Generate and preview app code from a text description

F5-TTS

🗣

2.88k

F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)

NIST FRVT TOP 1 Face Recognition, Face Liveness Detection, Face Analysis

🥇

600

Compare two faces to verify identity

liked a Space almost 2 years ago

MiniAiLive Face Recognition WebAPI Playground

🥇

183

Advanced 1:1 & 1:N Face Matching Technology, On-premise SDK

liked 2 models almost 2 years ago

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4, 2025 • 183k • 1.54k

mistral-community/pixtral-12b-240910

Image-Text-to-Text • Updated Oct 1, 2024 • 581 • 381

liked a Space almost 2 years ago

Background Removal

🌘

2.87k

Remove backgrounds from images instantly

Ampomah Theophilus

AI & ML interests

Organizations

IndexTeam/IndexTTS-2

black-forest-labs/FLUX.1-Kontext-dev

prithivMLmods/Qwen2-VL-OCR-2B-Instruct

Multimodal OCR

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

TripoSG

DeepSite v4

microsoft/Phi-4-multimodal-instruct

Kokoro TTS

BEN2

Qwen/Qwen2.5-VL-3B-Instruct

Stable Point-Aware 3D

TRELLIS

Qwen2.5 Coder Artifacts

F5-TTS

NIST FRVT TOP 1 Face Recognition, Face Liveness Detection, Face Analysis

MiniAiLive Face Recognition WebAPI Playground

stepfun-ai/GOT-OCR2_0

mistral-community/pixtral-12b-240910

Background Removal

Ampomah Theophilus

AI & ML interests

Organizations

bigbrotherr's activity

Multimodal OCR

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

TripoSG

DeepSite v4

Kokoro TTS

BEN2

Stable Point-Aware 3D

TRELLIS

Qwen2.5 Coder Artifacts

F5-TTS

NIST FRVT TOP 1 Face Recognition, Face Liveness Detection, Face Analysis

MiniAiLive Face Recognition WebAPI Playground

Background Removal