Proceedings of the 9th international conference on advanced data mining and applications pages:517-528
Lecture Notes in Computer Science 8346
9th international conference on advanced data mining and applications location:Hangzhou, China date:14-16 December 2014
The labeling of discussion forums using the cognitive levels of Bloom’s taxonomy is a time-consuming and very expensive task due to the big amount of information that needs to be labeled and the need of an expert in the educational field for applying the taxonomy according to the messages of the forums. In this paper we present a framework in order to automatically label messages from discussion forums using the categories of Bloom’s taxonomy. Several models were created using three kind of machine learning approaches: linear, rule-based and combined classifiers. The models are evaluated using the accuracy, the F1-measure and the area under the ROC curve. Additionally, a statistical significance of the results is performed using a McNemar test in order to validate them. The results show that the combination of a linear classifier with a rule-based classifier yields very good and promising results for this difficult task.