MusicGen Stereo Collection A collection of stereo music generation models as part of the v2 MusicGen release. • 4 items • Updated Apr 24, 2024 • 18
HDR Video Generation via Latent Alignment with Logarithmic Encoding Paper • 2604.11788 • Published 26 days ago • 10
ERNIE-Image Collection The serieas of image generation models, including text2img、img2img. • 2 items • Updated 24 days ago • 23
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 26 days ago • 28
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 26 days ago • 71
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 89 items • Updated 10 days ago • 590
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated 16 days ago • 42
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 7 items • Updated 6 days ago • 55