DOI:10.2298/CSIS100420034V
Ontology-based multi-label classification of economic articles
- Jožef Stefan International Postgraduate School
Jamova 39, 1000 Ljubljana, Slovenia
sergeja.sabo@mps.si - University of Ljubljana, Faculty of Computer and Information Science
Tržaška cesta 25, 1000 Ljubljana, Slovenia
zoran.bosnic@fri.uni-lj.si
Abstract
The paper presents an approach to the task of automatic document categorization in the field of economics. Since the documents can be annotated with multiple keywords (labels), we approach this task by applying and evaluating multi-label classification methods of supervised machine learning. We describe forming a test corpus of 1015 economic documents that we automatically classify using a tool which integrates ontology construction with text mining methods. In our experimental work, we evaluate three groups of multi-label classification approaches: transformation to single-class problems, specialized multi-label models, and hierarchical/ranking models. The classification accuracies of all tested classification models indicate that there is a potential for using all of the evaluated methods to solve this task. The results show the benefits of using complex groups of approaches which benefit from exploiting dependence between the labels. A good alternative to these approaches is also single-class naive Bayes classifiers coupled with the binary relevance transformation approach.
Key words
ontology, multi-label classification, machine learning, text categorization, economics, document classification
Digital Object Identifier (DOI)
https://doi.org/10.2298/CSIS100420034V
Publication information
Volume 8, Issue 1 (January 2011)
Year of Publication: 2011
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium
Full text
Available in PDF
Portable Document Format
How to cite
Vogrinčič, S., Bosnić, Z.: Ontology-based multi-label classification of economic articles. Computer Science and Information Systems, Vol. 8, No. 1, 101-119. (2011), https://doi.org/10.2298/CSIS100420034V