1 8 3

Yifu Chen

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

upvoted a paper 2 days ago

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

upvoted a paper 2 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

View all activity

Organizations

None yet

upvoted 3 papers 2 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Paper • 2605.28618 • Published 7 days ago • 27

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

Paper • 2605.30940 • Published 5 days ago • 31

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 5 days ago • 50

submitted a paper to Daily Papers about 1 month ago

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Paper • 2604.14932 • Published Apr 16 • 11

authored a paper about 1 month ago

WavChat: A Survey of Spoken Dialogue Models

Paper • 2411.13577 • Published Nov 15, 2024 • 2

upvoted a paper about 1 month ago

Dual-Axis Generative Reward Model Toward Semantic and Turn-taking Robustness in Interactive Spoken Dialogue Models

Paper • 2604.14920 • Published Apr 16 • 3

authored 3 papers about 1 month ago

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Paper • 2604.14932 • Published Apr 16 • 11

Dual-Axis Generative Reward Model Toward Semantic and Turn-taking Robustness in Interactive Spoken Dialogue Models

Paper • 2604.14920 • Published Apr 16 • 3

WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models

Paper • 2502.14727 • Published Feb 20, 2025 • 2

upvoted a paper about 1 month ago

VoxMind: An End-to-End Agentic Spoken Dialogue System

Paper • 2604.15710 • Published Apr 17 • 8

liked a dataset 4 months ago

WavBench/WavBench

Updated Mar 16 • 2.95k • 3

upvoted a paper 6 months ago

RULER-Bench: Probing Rule-based Reasoning Abilities of Next-level Video Generation Models for Vision Foundation Intelligence

Paper • 2512.02622 • Published Dec 2, 2025 • 10

published a model 12 months ago

1f/bvccf

Updated Jun 7, 2025

updated a model 12 months ago

1f/grsdfdf

Updated Jun 7, 2025

published a model 12 months ago

1f/grsdfdf

Updated Jun 7, 2025

published a dataset 12 months ago

1f/dsasd

Updated Jun 4, 2025 • 4

upvoted a paper about 1 year ago

WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

Paper • 2505.09558 • Published May 14, 2025 • 10

updated a dataset about 1 year ago

1f/dwa

Preview • Updated May 15, 2025 • 8

published a dataset about 1 year ago

1f/dwa

Preview • Updated May 15, 2025 • 8

upvoted a paper over 1 year ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50

Yifu Chen

AI & ML interests

Recent Activity

Organizations

1f's activity