Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xinfa Zhu's picture
1 5 12

Xinfa Zhu

xfzhu
21world's profile picture webxos's profile picture HadiMs's profile picture
·
https://orcid.org/0000-0001-9275-523X
  • zxf-icpc

AI & ML interests

Speech Generation

Organizations

Qwen's profile picture

upvoted a collection 2 months ago

Qwen3-TTS

Collection
7 items • Updated Jan 22 • 335
upvoted a collection about 1 year ago

Llasa

Collection
TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 11 items • Updated May 11, 2025 • 20
upvoted an article about 1 year ago
view article
Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Feb 11, 2025
•
34
upvoted a paper about 1 year ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6, 2025 • 27
upvoted a paper over 1 year ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 87
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs