Learning Unseen Modality Interaction
| Authors | |
|---|---|
| Publication date | 2023 |
| Host editors |
|
| Book title | 37th Conference on Neural Information Processing Systems (NeurIPS 2023) |
| Book subtitle | 10-16 December 2023, New Orleans, Louisana, USA |
| ISBN (electronic) |
|
| Series | Advances in Neural Information Processing Systems |
| Event | 37th Conference on Neural Information Processing Systems (NeurIPS 2023) |
| Number of pages | 11 |
| Publisher | Neural Information Processing Systems Foundation |
| Organisations |
|
| Abstract |
Multimodal learning assumes all modality combinations of interest are available during training to learn cross-modal correspondences. In this paper, we challenge this modality-complete assumption for multimodal learning and instead strive for generalization to unseen modality combinations during inference. We pose the problem of unseen modality interaction and introduce a first solution. It exploits a module that projects the multidimensional features of different modalities into a common space with rich information preserved. This allows the information to be accumulated with a simple summation operation across available modalities. To reduce overfitting to less discriminative modality combinations during training, we further improve the model learning with pseudo-supervision indicating the reliability of a modality’s prediction. We demonstrate that our approach is effective for diverse tasks and modalities by evaluating it for multimodal video classification, robot state regression, and multimedia retrieval. Project website: https://xiaobai1217.github.io/Unseen-Modality-Interaction/.
|
| Document type | Conference contribution |
| Note | With supplementary file |
| Language | English |
| Published at | https://doi.org/10.48550/arXiv.2306.12795 |
| Published at | https://papers.nips.cc/paper_files/paper/2023/hash/abb4847bbd60f38b1b7649d26c7a0067-Abstract-Conference.html |
| Other links | https://xiaobai1217.github.io/Unseen-Modality-Interaction/ https://doi.org/10.52202/075280 |
| Downloads |
NeurIPS-2023-learning-unseen-modality-interaction-Paper-Conference
(Accepted author manuscript)
|
| Supplementary materials | |
| Permalink to this page | |
