Neural Acoustic Processing Lab

university

https://naplab.ee.columbia.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

LinyangHe authored a paper 3 days ago

XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs

LinyangHe authored a paper 3 days ago

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking

LinyangHe authored a paper 3 days ago

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

View all activity

authored 6 papers 3 days ago

XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs

Paper • 2502.19737 • Published Feb 27, 2025

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking

Paper • 2601.17645 • Published Jan 25 • 23

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

Paper • 2509.15661 • Published Sep 19, 2025 • 2

Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations

Paper • 2509.15655 • Published Sep 19, 2025 • 2

BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data

Paper • 2510.10159 • Published Oct 11, 2025 • 3

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523

xi-j

updated a dataset 3 months ago

naplab/AVMeme-Exam

Updated Jan 28 • 23 • 24

xi-j

in naplab/AVMeme-Exam 3 months ago

RuntimeError: Dataset scripts are no longer supported, but found AVMeme-Exam.py

#2 opened 3 months ago by

authored a paper 3 months ago

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking

Paper • 2601.17645 • Published Jan 25 • 23

xi-j

submitted a paper to Daily Papers 3 months ago

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking

Paper • 2601.17645 • Published Jan 25 • 23

authored 4 papers 3 months ago

Learning Representations for New Sound Classes With Continual Self-Supervised Learning

Paper • 2205.07390 • Published May 15, 2022

Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue

Paper • 2409.04927 • Published Sep 7, 2024

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

Paper • 2509.15661 • Published Sep 19, 2025 • 2

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking

Paper • 2601.17645 • Published Jan 25 • 23

updated a dataset 4 months ago

naplab/AVMeme-Exam

Updated Jan 28 • 23 • 24

authored 2 papers 6 months ago

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

Paper • 2509.15661 • Published Sep 19, 2025 • 2

Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations

Paper • 2509.15655 • Published Sep 19, 2025 • 2

published a dataset 7 months ago

naplab/AVMeme-Exam

Updated Jan 28 • 23 • 24

xi-j

authored 2 papers about 1 year ago

Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis

Paper • 2407.09732 • Published Jul 13, 2024 • 10

Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation

Paper • 2408.11849 • Published Aug 13, 2024