holyhigh666 (Haolei)

liked a Space 3 months ago

OmniVoice

🌍

1.07k

High-quality voice cloning TTS for 600+ languages

liked 4 Spaces about 1 year ago

Self Forcing Wan 2.1

🎥

326

Real-time video generation

Inference Playground

🔋

296

Try AI models instantly with a web playground

ICEdit

🖼

666

Universal Image Editing is worth a single LoRA

Multilingual LLM Tokenizers

⚡

24

you to experience how Multilingual tokenizers work.

liked 3 Spaces over 1 year ago

DeepSite v4

🐳

16.6k

Generate any application by Vibe Coding it

Spark TTS

🌖

229

A text-to-speech model powered by SparkAudio and Mobvoi.

Whisper Realtime Transcription

👂

13

Transcribe audio in realtime with Whisper

liked a model over 1 year ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 529k • 1.61k

liked a Space over 1 year ago

SmolVLM2 XSPFGenerator (VLC prototype)

🎞

32

Generate video highlights and playlist

liked a model over 1 year ago

ahmed-masry/chartgemma

Image-Text-to-Text • 3B • Updated Jul 27, 2024 • 636 • 45

liked 4 Spaces over 1 year ago

VLM R1 Referral Expression

💬

72

Mark regions in images based on text descriptions

Ovis2 16B

🦫

126

See, read, and reason—better together.

BiRefNet Demo

👁

316

Remove background from photos with accurate segmentation

TTS Spaces Arena

🤗

486

Blind vote on HF TTS models!

liked a model over 1 year ago

m-a-p/YuE-s1-7B-anneal-en-cot

Text Generation • 6B • Updated Mar 12, 2025 • 4.06k • 455

liked 4 Spaces over 1 year ago

Open SUNO

👩

51

Your Lyrics into Complete Songs with Vocals in Multilingual

FacePoke

🙂

2.21k

Import a portrait, click to move the head!

Live Portrait

🤪

3.75k

Apply the motion of a video on a portrait

Chat With Janus-Pro-7B

🌍

2.02k

A unified multimodal understanding and generation model.

Haolei

AI & ML interests

Organizations

OmniVoice

Self Forcing Wan 2.1

Inference Playground

ICEdit

Multilingual LLM Tokenizers

DeepSite v4

Spark TTS

Whisper Realtime Transcription

microsoft/Phi-4-multimodal-instruct

SmolVLM2 XSPFGenerator (VLC prototype)

ahmed-masry/chartgemma

VLM R1 Referral Expression

Ovis2 16B

BiRefNet Demo

TTS Spaces Arena

m-a-p/YuE-s1-7B-anneal-en-cot

Open SUNO

FacePoke

Live Portrait

Chat With Janus-Pro-7B

Haolei

AI & ML interests

Organizations

holyhigh666's activity

OmniVoice

Self Forcing Wan 2.1

Inference Playground

ICEdit

Multilingual LLM Tokenizers

DeepSite v4

Spark TTS

Whisper Realtime Transcription

SmolVLM2 XSPFGenerator (VLC prototype)

VLM R1 Referral Expression

Ovis2 16B

BiRefNet Demo

TTS Spaces Arena

Open SUNO

FacePoke

Live Portrait

Chat With Janus-Pro-7B