AGREE

Official model weights for AGREE, introduced in the paper "Few-shot Acoustic Synthesis with Multimodal Flow Matching" by Amandine Brunetto (CVPR 2026).

AGREE is a joint embedding model for acoustics and room geometry. It learns a shared representation between room impulse responses (RIRs) and panoramic depth maps captured at the receiver position.

The model can be used for:

  • Evaluating geometry consistency of generated RIRs (via retrieval metrics and Fréchet distance, as done in the paper)
  • Downstream multimodal learning tasks involving acoustics and geometry
  • Audio-visual representation learning

This repository contains the pretrained weights. To run AGREE, please use the official codebase.


Available checkpoints

file description
AGREE_AR.ckpt Model trained on the Acoustic Rooms (AR) training set. Intended for downstream tasks.
AGREE_fullAR.ckpt Model trained on the full AR dataset. Used in the paper for evaluation of RIR generation.
AGREE_fullHAA.ckpt Model fine-tuned on the full HAA dataset, used for evaluation of RIR generation.

Download

Weights can be downloaded with:

huggingface-cli download AmandineBtto/AGREE --local-dir weights/AGREE
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support