Julia K

juliak115

AI & ML interests

None yet

Recent Activity

upvoted a paper about 11 hours ago

ChildVox: A Speech, Audio, and Large Audio-Language Model Benchmark in Understanding and Characterizing Sound across Childhood

upvoted a paper about 11 hours ago

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

upvoted a paper 3 months ago

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

View all activity

Organizations

None yet

upvoted 2 papers about 11 hours ago

ChildVox: A Speech, Audio, and Large Audio-Language Model Benchmark in Understanding and Characterizing Sound across Childhood

Paper • 2605.29257 • Published 1 day ago • 3

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Paper • 2605.29801 • Published 1 day ago • 79

upvoted 2 papers 3 months ago

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

Paper • 2603.12257 • Published Mar 12 • 31

Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data

Paper • 2603.07534 • Published Mar 8 • 5

upvoted 11 papers 4 months ago

End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions

Paper • 2601.17640 • Published Jan 25 • 6

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published Jan 26 • 126

Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis

Paper • 2601.14417 • Published Jan 20 • 5

HeartMuLa: A Family of Open Sourced Music Foundation Models

Paper • 2601.10547 • Published Jan 15 • 49

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Paper • 2601.03193 • Published Jan 6 • 51

Motion Attribution for Video Generation

Paper • 2601.08828 • Published Jan 13 • 72

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 328

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 214

upvoted 2 papers 10 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 276

Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe

Paper • 2508.01691 • Published Aug 3, 2025 • 10

liked 3 models about 1 year ago

speechbrain/spkrec-xvect-voxceleb

Audio Classification • Updated Feb 25, 2024 • 35.1k • 66

tiantiaf/whisper-large-v3-narrow-accent

Audio Classification • 2B • Updated Aug 10, 2025 • 600 • 5

tiantiaf/whisper-large-v3-msp-podcast-emotion

Audio Classification • 2B • Updated Aug 10, 2025 • 21.2k • 5

Julia K

AI & ML interests

Recent Activity

Organizations

juliak115's activity