7 13

Zhiyu

zylin1

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

nvidia/audio-flamingo-next-captioner-hf

liked a model 3 months ago

google/gemma-3n-E4B-it

updated a dataset 4 months ago

zylin1/deear_tts

View all activity

Organizations

None yet

liked a model about 2 months ago

nvidia/audio-flamingo-next-captioner-hf

Audio-Text-to-Text • 8B • Updated May 13 • 1.59k • 18

liked a model 3 months ago

google/gemma-3n-E4B-it

Image-Text-to-Text • 8B • Updated Jul 14, 2025 • 21.5k • • 919

updated a dataset 4 months ago

zylin1/deear_tts

Viewer • Updated Feb 25 • 5.18k • 9

published 2 datasets 4 months ago

zylin1/deear_tts

Viewer • Updated Feb 25 • 5.18k • 9

zylin1/DeEAR_backup

Updated Oct 10, 2025 • 2

upvoted a paper 5 months ago

RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation

Paper • 2601.08430 • Published Jan 13 • 62

upvoted a paper 6 months ago

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Paper • 2512.11558 • Published Dec 12, 2025 • 45

liked a Space 8 months ago

Qwen3 Omni Captioner Demo

🐠

Generate a caption for any uploaded or recorded audio

liked a model 8 months ago

FreedomIntelligence/DeEAR_Base

0.3B • Updated Sep 25, 2025 • 14 • 3

updated a dataset 8 months ago

FreedomIntelligence/ExpressiveSpeech

Viewer • Updated Oct 24, 2025 • 10.8k • 375 • 10

updated a dataset 9 months ago

zylin1/DeEAR_backup

Updated Oct 10, 2025 • 2

updated a model 9 months ago

FreedomIntelligence/DeEAR_Base

0.3B • Updated Sep 25, 2025 • 14 • 3

published a model 9 months ago

FreedomIntelligence/DeEAR_Base

0.3B • Updated Sep 25, 2025 • 14 • 3

liked a dataset 9 months ago

FreedomIntelligence/ExpressiveSpeech

Viewer • Updated Oct 24, 2025 • 10.8k • 375 • 10

published a dataset 9 months ago

FreedomIntelligence/ExpressiveSpeech

Viewer • Updated Oct 24, 2025 • 10.8k • 375 • 10

upvoted a paper 10 months ago

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Paper • 2508.13618 • Published Aug 19, 2025 • 19

liked a model 11 months ago

emotion2vec/emotion2vec_plus_large

Updated Jun 24, 2024 • 988 • 77

liked a model 12 months ago

audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim

Audio Classification • 0.2B • Updated Sep 19, 2024 • 653k • 169

upvoted a paper about 1 year ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published Jun 22, 2025 • 67

liked a model about 1 year ago

stepfun-ai/Step-Audio-AQAA

137B • Updated Jun 12, 2025 • 23 • 49

Zhiyu

AI & ML interests

Recent Activity

Organizations

zylin1's activity

Qwen3 Omni Captioner Demo