Title: Don't be afraid of simpler patterns
Authors: Bringmann, Bj√∂rn ×
Zimmermann, Albrecht
De Raedt, Luc
Nijssen, Siegfried #
Issue Date: Sep-2006
Publisher: Springer
Series Title: Lecture Notes in Computer Science vol:4213 pages:55-66
Conference: 10th European Conference on Principle and Practice of Knowledge Discovery in Databases location:Berlin, GERMANY date:18-22 Sept 2006
Abstract: This paper investigates the trade-off between the expressiveness of the pattern language and the performance of the pattern miner in structured data mining. This trade-off is investigated in the context of correlated pattern mining, which is concerned with finding the k-best patterns according to a convex criterion, for the pattern languages of itemsets, multi-itemsets, sequences, trees and graphs. The criteria used in our investigation are the typical ones in data mining: computational cost and predictive accuracy and the domain is that of mining molecular graph databases. More specifically, we provide empirical answers to the following questions: how does the expressive power of the language affect the computational cost? and what is the trade-off between expressiveness of the pattern language and the predictive accuracy of the learned model? While answering the first question, we also introduce a novel stepwise approach to correlated pattern mining in which the results of mining a simpler pattern language are employed as a starting point for mining in a more complex one. This stepwise approach typically leads to significant speed-ups (up to a factor 1000) for mining graphs.
Description: acceptance rate = 8.8%
ISSN: 0302-9743
Publication status: published
KU Leuven publication type: IT
Appears in Collections:Informatics Section
× corresponding author
# (joint) last author

Files in This Item:
File Status SizeFormat
42438.pdf Published 204KbAdobe PDFView/Open


All items in Lirias are protected by copyright, with all rights reserved.

© Web of science