The PhotoBook Dataset: Building Common Ground through Visually Grounded Dialogue
| Authors | |
|---|---|
| Publication date | 2019 |
| Host editors |
|
| Book title | The 57th Annual Meeting of the Association for Computational Linguistics |
| Book subtitle | ACL 2019 : proceedings of the conference : July 28-August 2, 2019, Florence, Italy |
| ISBN (electronic) |
|
| Event | The 57th Annual Meeting of the Association for Computational Linguistics - ACL 2019 |
| Pages (from-to) | 1895-1910 |
| Publisher | Stroudsburg, PA: The Association for Computational Linguistics |
| Organisations |
|
| Abstract |
This paper introduces the PhotoBook dataset, a large-scale collection of visually-grounded, task-oriented dialogues in English designed to investigate shared dialogue history accumulating during conversation. Taking inspiration from seminal work on dialogue analysis, we propose a data-collection task formulated as a collaborative game prompting two online participants to refer to images utilising both their visual context as well as previously established referring expressions. We provide a detailed description of the task setup and a thorough analysis of the 2,500 dialogues collected. To further illustrate the novel features of the dataset, we propose a baseline model for reference resolution which uses a simple method to take into account shared information accumulated in a reference chain. Our results show that this information is particularly important to resolve later descriptions and underline the need to develop more sophisticated models of common ground in dialogue interaction.
|
| Document type | Conference contribution |
| Note | Later version also available. |
| Language | English |
| Related dataset | The PhotoBook Task and Dataset |
| Published at | https://doi.org/10.18653/v1/P19-1184 |
| Other links | https://vimeo.com/384520306 |
| Downloads |
P19-1184v1
(Final published version)
P19-1184v2
(Other version)
|
| Permalink to this page | |