siyiwind
/

DyMo

Model card Files Files and versions

xet

Community

Add metadata and improve model card structure

by nielsr HF Staff - opened 13 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+40

-19

Files changed (1) hide show

README.md +40 -19

README.md CHANGED Viewed

@@ -1,19 +1,40 @@
-<div align="center">
-<h1><a href="https://openreview.net/forum?id=PWhDUWRVhM&noteId=PWhDUWRVhM">Inference-Time Dynamic Modality Selection for Incomplete Multimodal Classification (ICLR 2026)</a></h1>
-**[Siyi Du](https://scholar.google.com/citations?user=zsOt8MYAAAAJ&hl=en), [Xinzhe Luo](https://scholar.google.com/citations?user=l-oyIaAAAAAJ&hl=en&oi=ao), [Declan P. O'Regan](https://scholar.google.com/citations?user=85u-LbAAAAAJ&hl=en&oi=ao), and [Chen Qin](https://scholar.google.com/citations?view_op=list_works&hl=en&hl=en&user=mTWrOqHOqjoC&pagesize=80&sortby=pubdate)**
-[![](https://img.shields.io/badge/license-Apache--2.0-blue)](#License)
-[![](https://img.shields.io/badge/arXiv-2503.06277-b31b1b.svg)](https://arxiv.org/abs/2601.22853)
-</div>
-![DyMo](./Images/overview.jpg)
-<p align="center">(a-b) Evidence of the discarding-imputation dilemma: (a-1) vs. (a-2) recovery-free methods (e.g., ModDrop) learn less discriminative features because they ignore highly task-relevant missing modalities {M,T}; (b) recovery-based methods (e.g., MoPoE) generate unreliable reconstructions, e.g., low-fidelity (orange) or misaligned (yellow). (c) Our DyMo, which addresses the dilemma by dynamically fusing task-relevant recovered modalities, improving accuracy by 1.61% on PolyMNIST, 1.68% on MST, and 3.88% on CelebA (Tab 1).</p>
-This repository provides **pretrained model checkpoints** for [Inference-Time Dynamic Modality Selection for Incomplete Multimodal Classification](https://openreview.net/forum?id=PWhDUWRVhM&noteId=PWhDUWRVhM).
-For the **training code, evaluation scripts, and usage instructions**, please refer to the official GitHub repository:
-👉 [https://github.com/siyi-wind/DyMo](https://github.com/siyi-wind/DyMo).

+---
+license: apache-2.0
+pipeline_tag: image-classification
+---
+<div align="center">
+<h1><a href="https://huggingface.co/papers/2601.22853">Inference-Time Dynamic Modality Selection for Incomplete Multimodal Classification (ICLR 2026)</a></h1>
+**[Siyi Du](https://scholar.google.com/citations?user=zsOt8MYAAAAJ&hl=en), [Xinzhe Luo](https://scholar.google.com/citations?user=l-oyIaAAAAAJ&hl=en&oi=ao), [Declan P. O'Regan](https://scholar.google.com/citations?user=85u-LbAAAAAJ&hl=en&oi=ao), and [Chen Qin](https://scholar.google.com/citations?view_op=list_works&hl=en&hl=en&user=mTWrOqHOqjoC&pagesize=80&sortby=pubdate)**
+[![](https://img.shields.io/badge/license-Apache--2.0-blue)](#License)
+[![](https://img.shields.io/badge/arXiv-2601.22853-b31b1b.svg)](https://arxiv.org/abs/2601.22853)
+</div>
+![DyMo](./Images/overview.jpg)
+<p align="center">(a-b) Evidence of the discarding-imputation dilemma: (a-1) vs. (a-2) recovery-free methods (e.g., ModDrop) learn less discriminative features because they ignore highly task-relevant missing modalities {M,T}; (b) recovery-based methods (e.g., MoPoE) generate unreliable reconstructions, e.g., low-fidelity (orange) or misaligned (yellow). (c) Our DyMo, which addresses the dilemma by dynamically fusing task-relevant recovered modalities, improving accuracy by 1.61% on PolyMNIST, 1.68% on MST, and 3.88% on CelebA (Tab 1).</p>
+This repository provides **pretrained model checkpoints** for the paper [Inference-Time Dynamic Modality Selection for Incomplete Multimodal Classification](https://huggingface.co/papers/2601.22853).
+For the **training code, evaluation scripts, and usage instructions**, please refer to the official GitHub repository:
+👉 [https://github.com/siyi-wind/DyMo](https://github.com/siyi-wind/DyMo).
+## Abstract
+Multimodal deep learning (MDL) has achieved remarkable success across various domains, yet its practical deployment is often hindered by incomplete multimodal data. Existing incomplete MDL methods either discard missing modalities, risking the loss of valuable task-relevant information, or recover them, potentially introducing irrelevant noise, leading to the discarding-imputation dilemma. To address this dilemma, we propose DyMo, a new inference-time dynamic modality selection framework that adaptively identifies and integrates reliable recovered modalities, fully exploring task-relevant information beyond the conventional discard-or-impute paradigm.
+## Citation
+If you find this work useful, please cite:
+```bibtex
+@inproceedings{du2026dymo,
+  title={Inference-Time Dynamic Modality Selection for Incomplete Multimodal Classification},
+  author={Du, Siyi and Luo, Xinzhe and O'Regan, Declan P. and Qin, Chen},
+  booktitle={International Conference on Learning Representations (ICLR) 2026},
+  year={2026}
+}
+```