Title: Unsupervised learning of an IS-A taxonomy from a limited domain-specific corpus
Authors: Alfarone, Daniele
Davis, Jesse
Issue Date: May-2014
Publisher: Department of Computer Science, KU Leuven
Series Title: CW Reports vol:CW664
Abstract: This report addresses the problem of learning a taxonomy from a given domain-specific text corpus. We propose a novel unsupervised algorithm for this problem. Its key contributions include a clustering-based inference approach that increases recall over surface patterns and a graph-based algorithm for detecting incorrect edges that improves precision. Our system induces the taxonomy simply by analyzing the provided corpus. Thus, the learned taxonomy is focused on the concepts that are relevant for the specific corpus. An empirical evaluation on five corpora demonstrates the utility of the system.
Publication status: published
KU Leuven publication type: IR
Appears in Collections:Informatics Section

Files in This Item:
File Description Status SizeFormat
CW664.pdfDocument Published 850KbAdobe PDFView/Open


All items in Lirias are protected by copyright, with all rights reserved.