Morioh-cho

university

https://ntu.edu.tw

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

dlion168 submitted a paper 7 days ago

Speaker Identity in Non-Verbal Vocalizations: Conditional Distillation and Mixture of Experts Approach

huckiyang authored a paper 21 days ago

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

huckiyang authored a paper 21 days ago

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

View all activity

submitted a paper to Daily Papers 7 days ago

Speaker Identity in Non-Verbal Vocalizations: Conditional Distillation and Mixture of Experts Approach

Paper • 2606.21215 • Published 14 days ago

authored 2 papers 21 days ago

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Paper • 2604.24954 • Published Apr 27 • 26

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Paper • 2602.14224 • Published Feb 15

authored 12 papers 3 months ago

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Paper • 2510.16917 • Published Oct 19, 2025 • 20

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

Paper • 2510.16893 • Published Oct 19, 2025 • 18

Extending Automatic Machine Translation Evaluation to Book-Length Documents

Paper • 2509.17249 • Published Sep 21, 2025

Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale

Paper • 2511.05705 • Published Nov 7, 2025 • 10

UALM: Unified Audio Language Model for Understanding, Generation and Reasoning

Paper • 2510.12000 • Published Oct 13, 2025 • 1

ESPnet-SpeechLM: An Open Speech Language Model Toolkit

Paper • 2502.15218 • Published Feb 21, 2025

PRiSM: Benchmarking Phone Realization in Speech Models

Paper • 2601.14046 • Published Jan 20 • 7

TimeOmni-1: Incentivizing Complex Reasoning with Time Series in Large Language Models

Paper • 2509.24803 • Published Sep 29, 2025

An Investigation of Incorporating Mamba for Speech Enhancement

Paper • 2405.06573 • Published May 10, 2024

How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

Paper • 2603.19195 • Published Mar 19 • 4

Bagpiper: Solving Open-Ended Audio Tasks via Rich Captions

Paper • 2602.05220 • Published Feb 5

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Paper • 2604.10905 • Published Apr 13 • 29

updated a model 3 months ago

Morioh/toilet

Updated Mar 5, 2025

authored a paper 3 months ago

How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

Paper • 2603.19195 • Published Mar 19 • 4

submitted a paper to Daily Papers 3 months ago

How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

Paper • 2603.19195 • Published Mar 19 • 4

submitted a paper to Daily Papers 3 months ago

Nudging Hidden States: Training-Free Model Steering for Chain-of-Thought Reasoning in Large Audio-Language Models

Paper • 2603.14636 • Published Mar 15 • 4

authored a paper 3 months ago

A Preliminary Exploration with GPT-4o Voice Mode

Paper • 2502.09940 • Published Feb 14, 2025