Learning to request guidance in emergent communication

Open Access
Authors
  • Benjamin Kolb
  • Leon Lang
  • Henning Bartsch
  • Arwin Gansekoele
  • Raymond Koopmanschap
  • Leonardo Romor
  • David Speck
  • Mathijs Mul
  • Elia Bruni
Publication date 2019
Host editors
  • A. Mogadala
  • D. Klakow
  • S. Pezzelle
  • M.-F. Moens
Book title Beyond Vision and LANguage: inTEgrating Real-world kNowledge (LANTERN)
Book subtitle EMNLP-IJCNLP 2019 : proceedings of the workshop : November 3, 2019, Hong Kong, China
ISBN (electronic)
  • 9781950737758
Event 1st Workshop on Beyond Vision and LANguage: inTEgrating Real-World kNowledge, LANTERN@EMNLP-IJCNLP 2019
Pages (from-to) 41-50
Number of pages 10
Publisher Stroudsburg, PA: The Association for Computational Linguistics
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent. The guide achieves this by providing the agent with discrete messages in an emerged language about how to solve the task. We extend this one-directional communication by a one-bit communication channel from the learner back to the guide: It is able to ask the guide for help, and we limit the guidance by penalizing the learner for these requests. During training, the agent learns to control this gate based on its current observation. We find that the amount of requested guidance decreases over time and guidance is requested in situations of high uncertainty. We investigate the agent's performance in cases of open and closed gates and discuss potential motives for the observed gating behavior.

Document type Conference contribution
Language English
Published at https://doi.org/10.18653/v1/D19-6407
Other links https://www.scopus.com/pages/publications/85119376881
Downloads
D19-6407 (Final published version)
Permalink to this page
Back