Amelie Royer's picture

On Vacation 🏝️

Amelie Royer

ameroyer

·

https://ameroyer.github.io

AI & ML interests

Computer Vision, Domain Adaptation, Conditional Architectures

Recent Activity

upvoted a paper about 1 month ago

One View Is Enough! Monocular Training for In-the-Wild Novel View Generation

new activity about 2 months ago

kyutai/CASA-Helium1-VL-2B:Huggingface space?

updated a Space about 2 months ago

kyutai/casa-samples

View all activity

Organizations

upvoted a paper about 1 month ago

One View Is Enough! Monocular Training for In-the-Wild Novel View Generation

Paper • 2603.23488 • Published Mar 24 • 5

upvoted 6 papers 4 months ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published Jan 15 • 34

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 178

Moshi: a speech-text foundation model for real-time dialogue

Paper • 2410.00037 • Published Sep 17, 2024 • 16

Vision-Speech Models: Teaching Speech Models to Converse about Images

Paper • 2503.15633 • Published Mar 19, 2025 • 2

ARC-Encoder: learning compressed text representations for large language models

Paper • 2510.20535 • Published Oct 23, 2025 • 8

CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion

Paper • 2512.19535 • Published Dec 22, 2025 • 12