Automatic thresholding by sampling documents and estimating recall ILPs@UVA at Tar task 2.2

Open Access
Authors
Publication date 2019
Host editors
  • L. Cappellato
  • N. Ferro
  • D.E. Losada
  • H. Müller
Book title Working Notes of CLEF 2019 - Conference and Labs of the Evaluation Forum
Book subtitle Lugano, Switzerland, September 9-12, 2019
Series CEUR Workshop Proceedings
Event 10th International Conference of the CLEF Association, CLEF 2019
Article number 187
Number of pages 9
Publisher Aachen: CEUR-WS
Organisations
  • Faculty of Science (FNWI)
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

In this paper, we describe the participation of the Information and Language Processing System (ILPS) group at CLEF eHealth 2019 Task 2.2: Technologically Assisted Reviews in Empirical Medicine. This task is targeted to produce an efficient ordering of the documents and to identify a subset of the documents which contains as many of the relevant abstracts for the least effort. Participants are provided with systematic review topics with each including a review title, a boolean query constructed by Cochrane experts, and a set of PubMed Document Identifiers (PID's) returned by running the boolean query in MEDLINE. We handle the problem under the Continuous Active Learning framework by jointly training a ranking model to rank documents, and conducting a “greedy” sampling to estimate the real number of relevant documents in the collection. We finally submitted four runs.

Document type Conference contribution
Language English
Published at http://ceur-ws.org/Vol-2380/paper_187.pdf
Other links http://ceur-ws.org/Vol-2380/ https://www.scopus.com/pages/publications/85070498275
Downloads
paper_187 (Final published version)
Permalink to this page
Back