Title: Representations for multi-document event clustering
Authors: De Smet, Wim
Moens, Marie-Francine # ×
Issue Date: 2013
Publisher: Kluwer Academic Publishers
Series Title: Data Mining and Knowledge Discovery vol:26 issue:3 pages:533-558
Abstract: We study several techniques for representing, fusing and comparing content representations of news documents. As underlying models we consider the vector space model (both in a term setting and in a latent semantic analysis setting) and probabilistic topic models based on latent Dirichlet allocation. Content terms can be classified as topical terms or named entities, yielding several models for content fusion and comparison. All used methods are completely unsupervised. We find that simple methods can still outperform the current state-of-the-art techniques.
ISSN: 1384-5810
Publication status: published
KU Leuven publication type: IT
Appears in Collections:Informatics Section
× corresponding author
# (joint) last author

Files in This Item:
File Description Status SizeFormat
DeSmetMoensDMKD2012.pdfMain article Published 465KbAdobe PDFView/Open


All items in Lirias are protected by copyright, with all rights reserved.

© Web of science