Causal video summarizer for video exploration

Authors
Publication date 2022
Book title ICME 2022 : conference proceedings
Book subtitle IEEE International Conference on Multimedia and Expo 2022
ISBN
  • 9781665485647
ISBN (electronic)
  • 9781665485630
Event 2022 IEEE International Conference on Multimedia and Expo
Pages (from-to) 349-354
Number of pages 6
Publisher Piscataway, NJ: IEEE
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Recently, video summarization has been proposed as a method to help video exploration. However, traditional video summarization models only generate a fixed video summary which is usually independent of user-specific needs and hence limits the effectiveness of video exploration. Multi-modal video summarization is one of the approaches utilized to address this issue. Multi-modal video summarization has a video input and a text-based query input. Hence, effective modeling of the interaction between a video input and text-based query is essential to multi-modal video summarization. In this work, a new causality-based method named Causal Video Summarizer (CVS) is proposed to effectively capture the interactive information between the video and query to tackle the task of multi-modal video summarization. The proposed method consists of a probabilistic encoder and a probabilistic decoder. Based on the evaluation of the existing multi-modal video summarization dataset, experimental results show that the proposed approach is effective with the increase of +5.4% in accuracy and +4.92% increase of F1-score, compared with the state-of-the-art method.
Document type Conference contribution
Language English
Published at https://doi.org/10.1109/ICME52920.2022.9859948
Other links https://www.proceedings.com/65366.html
Permalink to this page
Back