Bayesian Prompt Learning for Image-Language Model Generalization

Open Access
Authors
Publication date 2023
Book title 2023 IEEE/CVF International Conference on Computer Vision
Book subtitle ICCV 2023 : Paris, France, 2-6 October 2023 : proceedings
ISBN
  • 9798350307191
ISBN (electronic)
  • 9798350307184
Event 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
Pages (from-to) 15191-15200
Publisher Los Alamitos, California: IEEE Computer Society
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Foundational image-language models have generated considerable interest due to their efficient adaptation to downstream tasks by prompt learning. Prompt learning treats part of the language model input as trainable while freezing the rest, and optimizes an Empirical Risk Mini-mization objective. However, Empirical Risk Minimization is known to suffer from distributional shifts which hurt gen-eralizability to prompts unseen during training. By leveraging the regularization ability of Bayesian methods, we frame prompt learning from the Bayesian perspective and formulate it as a variational inference problem. Our approach regularizes the prompt space, reduces overfitting to the seen prompts and improves the prompt generalization on unseen prompts. Our framework is implemented by modeling the input prompt space in a probabilistic manner, as an a priori distribution which makes our proposal compatible with prompt learning approaches that are unconditional or conditional on the image. We demonstrate empirically on 15 benchmarks that Bayesian prompt learning provides an appropriate coverage of the prompt space, prevents learning spurious features, and exploits transferable invariant features. This results in better generalization of unseen prompts, even across different datasets and domains.Code available at: https://github.com/saic-fi/Bayesian-Prompt-Learning
Document type Conference contribution
Note With supplementary file
Language English
Published at https://doi.org/10.48550/arXiv.2210.02390 https://doi.org/10.1109/ICCV51070.2023.01398
Published at https://openaccess.thecvf.com/content/ICCV2023/html/Derakhshani_Bayesian_Prompt_Learning_for_Image-Language_Model_Generalization_ICCV_2023_paper.html
Other links https://www.proceedings.com/72328.html
Downloads
Supplementary materials
Permalink to this page
Back