Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CelesteChen 's Collections
audio-visual foundation model
visual thinker
agent
creative-writing
multimodal
RL infra
application
acceleration
confidence
deepsearch
models
code
diffusion
multilingual
reasoning
RAG
others
long-context
math
Align
LLM-general

audio-visual foundation model

updated 4 days ago
Upvote
-

  • LTX-2: Efficient Joint Audio-Visual Foundation Model

    Paper • 2601.03233 • Published 5 days ago • 87
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs