<i>M</i><sup>2</sup>-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service Medical Dialogues

G. Yan; J. Pei; P. Ren; Z. Chen; Z. Ren; H. Liang

doi:https://doi.org/10.48550/arXiv.2109.00430

M²-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service Medical Dialogues

Authors	G. Yan J. Pei P. Ren Z. Chen Z. Ren H. Liang
Publication date	08-09-2021
Edition	v3
Number of pages	9
Publisher	ArXiv
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Medical dialogue systems (MDSs) aim to assist doctors and patients with a range of professional medical services, i.e., diagnosis, consultation, and treatment. However, one-stop MDS is still unexplored because: (1) no dataset has so large-scale dialogues contains both multiple medical services and fine-grained medical labels (i.e., intents, slots, values); (2) no model has addressed a MDS based on multiple-service conversations in a unified framework. In this work, we first build a Multiple-domain Multiple-service medical dialogue (M²-MedDialog)dataset, which contains 1,557 conversations between doctors and patients, covering 276 types of diseases, 2,468 medical entities, and 3 specialties of medical services. To the best of our knowledge, it is the only medical dialogue dataset that includes both multiple medical services and fine-grained medical labels. Then, we formulate a one-stop MDS as a sequence-to-sequence generation problem. We unify a MDS with causal language modeling and conditional causal language modeling, respectively. Specifically, we employ several pretrained models (i.e., BERT-WWM, BERT-MED, GPT2, and MT5) and their variants to get benchmarks on M²-MedDialog dataset. We also propose pseudo labeling and natural perturbation methods to expand M2-MedDialog dataset and enhance the state-of-the-art pretrained models. We demonstrate the results achieved by the benchmarks so far through extensive experiments on M²-MedDialog. We release the dataset, the code, as well as the evaluation scripts to facilitate future research in this important research direction.
Document type	Preprint
Note	Versions v1 and v2 (2021) and v4 (2022) also available on ArXiv.
Language	English
Published at	https://doi.org/10.48550/arXiv.2109.00430 (Final published version)
Downloads	M2-MedDialog. v3 (Submitted manuscript)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

M²-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service Medical Dialogues

UvA-DARE

Digital Academic Repository

M2-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service Medical Dialogues

M²-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service Medical Dialogues