- Text mining scientific papers: a survey on FCA-based information retrieval research
- Lecture Notes in Computer Science
- Pages (from-to)
- Document type
- Faculty of Economics and Business (FEB)
- Amsterdam Business School Research Institute (ABS-RI)
Formal Concept Analysis (FCA) is an unsupervised clustering technique and many scientific papers are devoted to applying FCA in Information Retrieval (IR) research. We collected 103 papers published between 2003-2009 which mention FCA and information retrieval in the abstract, title or keywords. Using a prototype of our FCA-based toolset CORDIET, we converted the pdf-files containing the papers to plain text, indexed them with Lucene using a thesaurus containing terms related to FCA research and then created the concept lattice shown in this paper. We visualized, analyzed and explored the literature with concept lattices and discovered multiple interesting research streams in IR of which we give an extensive overview. The core contributions of this paper are the innovative application of FCA to the text mining of scientific papers and the survey of the FCA-based IR research.
- go to publisher's site
- Proceedings title: Advances in data mining: applications and theoretical aspects: 12th Industrial Conference, ICDM 2012, Berlin,
Germany, July 13-20 2012: proceedings
Place of publication: Heidelberg
Editors: P. Perner
If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library, or send a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.