Title: Text mining scientific papers: a survey on FCA-based information retrieval research
Authors: Poelmans, Jonas
Ignatov, I.
Viaene, Stijn
Dedene, Guido
Kuznetsov, S.
Issue Date: 2012
Publisher: Springer
Host Document: Lecture Notes in Artificial Intelligence vol:7377 pages:273-287
Conference: Industrial Conference on Data Mining edition:12 location:Berlin (Germany) date:13-20 July 2013
Abstract: Formal Concept Analysis (FCA) is an unsupervised clustering technique and many scientific papers are devoted to applying FCA in Information Retrieval (IR) research. We collected 103 papers published between 2003-2009 which mention FCA and information retrieval in the abstract, title or keywords. Using a prototype of our FCA-based toolset CORDIET, we converted the pdf-files containing the papers to plain text, indexed them with Lucene using a thesaurus containing terms related to FCA research and then created the concept lattice shown in this paper. We visualized, analyzed and explored the literature with concept lattices and discovered multiple interesting research streams in IR of which we give an extensive overview. The core contributions of this paper are the innovative application of FCA to the text mining of scientific papers and the survey of the FCA-based IR research.
ISBN: 978-3-642-31487-2
978-3-642-31488-9 online
ISSN: 0302-9743
1611-3349 online
VABB publication type: VABB-5
Publication status: published
KU Leuven publication type: IC
Appears in Collections:Research Center for Management Informatics (LIRIS), Leuven
Department of Decision Sciences and Information Management, Leuven - miscellaneous
Algemene Diensten - UC Leuven

Files in This Item:
File Description Status SizeFormat
TextMiningScientific.pdf Published 604KbAdobe PDFView/Open Request a copy

These files are only available to some KU Leuven Association staff members


All items in Lirias are protected by copyright, with all rights reserved.