Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Qingkai Fang's picture
11 7 21

Qingkai Fang

poeroz
21world's profile picture starkprince's profile picture SteveSHEN's profile picture
·
https://fangqingkai.github.io/
  • poeroz

AI & ML interests

Large Language Models, Speech-Language Models, Speech Translation

Organizations

Natural Language Processing Group, Institute of Computing Technology, Chinese Academy of Science's profile picture

upvoted 2 papers 3 months ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31, 2025 • 73

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published Oct 19, 2025 • 109
upvoted a paper 9 months ago

Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space

Paper • 2505.13181 • Published May 19, 2025 • 9
upvoted a paper about 1 year ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7, 2025 • 52
upvoted 2 papers over 1 year ago

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 18

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10, 2024 • 60
upvoted a paper almost 2 years ago

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Paper • 2403.02677 • Published Mar 5, 2024 • 18
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs