Junkai Wu's picture

4

Junkai Wu

wjk0925

·

https://wjk0925.github.io/

wjk0925

AI & ML interests

None yet

Recent Activity

authored a paper about 10 hours ago

Learning Representations for New Sound Classes With Continual Self-Supervised Learning

authored a paper about 10 hours ago

Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue

authored a paper about 10 hours ago

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

View all activity

Organizations

authored 4 papers about 10 hours ago

Learning Representations for New Sound Classes With Continual Self-Supervised Learning

Paper • 2205.07390 • Published May 15, 2022

Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue

Paper • 2409.04927 • Published Sep 7, 2024

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

Paper • 2509.15661 • Published Sep 19, 2025 • 2

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking

Paper • 2601.17645 • Published 3 days ago

upvoted a paper 6 months ago

DMOSpeech 2: Reinforcement Learning for Duration Prediction in Metric-Optimized Speech Synthesis

Paper • 2507.14988 • Published Jul 20, 2025 • 8

upvoted a paper 8 months ago

Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach

Paper • 2505.18882 • Published May 24, 2025 • 14

upvoted a paper 11 months ago

AAD-LLM: Neural Attention-Driven Auditory Scene Understanding

Paper • 2502.16794 • Published Feb 24, 2025 • 5

upvoted a paper over 1 year ago

Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis

Paper • 2407.09732 • Published Jul 13, 2024 • 10