Automatic single-document key fact extraction from newswire articles

Open Access
Authors
Publication date 2009
Host editors
  • A. Lascarides
  • C. Gardent
  • J. Nivre
Book title Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Book subtitle EACL 2009: 30 March-3 April 2009, Megaron Athens International Conference Centre, Athens, Greece
ISBN (electronic)
  • 9781932432169
Event 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL-09), Athens, Greece
Pages (from-to) 415-423
Publisher Stroudsburg, PA: Association for Computational Linguistics (ACL)
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
This paper addresses the problem of extracting the most important facts from a news article. Our approach uses syntactic, semantic, and general statistical features to identify the most important sentences in a document. The importance of the individual features is estimated using generalized iterative scaling methods trained on an annotated newswire corpus. The performance of our approach is evaluated against 300 unseen news articles and shows that use of these features results in statistically significant improvements over a provenly robust baseline, as measured using metrics such as precision, recall and ROUGE.
Document type Conference contribution
Language English
Published at https://doi.org/10.3115/1609067.1609113
Published at https://aclanthology.org/E09-1048
Downloads
320577.pdf (Final published version)
Permalink to this page
Back