Nemotron-Orbax with JAX Collection Distributed Checkpointing Conversion on Nemotron Series • 1 item • Updated 5 days ago
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published Apr 27 • 26
The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents Paper • 2602.14224 • Published Feb 15
Nemotron-Orbax with JAX Collection Distributed Checkpointing Conversion on Nemotron Series • 1 item • Updated 5 days ago
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models Paper • 2510.16917 • Published Oct 19, 2025 • 20
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations Paper • 2510.16893 • Published Oct 19, 2025 • 18
Extending Automatic Machine Translation Evaluation to Book-Length Documents Paper • 2509.17249 • Published Sep 21, 2025
Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale Paper • 2511.05705 • Published Nov 7, 2025 • 10
UALM: Unified Audio Language Model for Understanding, Generation and Reasoning Paper • 2510.12000 • Published Oct 13, 2025 • 1