DOI:10.2298/CSIS100906003W
Integrating Instance-level and Attribute-level Knowledge into Document Clustering
- School of Computer Engineering, Qingdao Technological University
266033 Qingdao, China
{wangjinlong,shunyaowu}@gmail.com - School of Information Technology, Deakin University
3125, Victoria, Australia
gang.li@deakin.edu.au - Medical College of Qingdao University
266021 Qingdao, China - State Key Laboratory of CAD&CG, Zhejiang University
310027 Hangzhou, China - SANYHE International Holding Co., Ltd,
110027 Shenyang, China
Abstract
In this paper, we present a document clustering framework incorporating instance-level knowledge in the form of pairwise constraints and attribute-level knowledge in the form of keyphrases. Firstly, we initialize weights based on metric learning with pairwise constraints, then simultaneously learn two kinds of knowledge by combining the distance-based and the constraint-based approaches, finally evaluate and select clustering result based on the degree of users’ satisfaction. The experimental results demonstrate the effectiveness and potential of the proposedmethod.
Key words
document clustering, pairwise constraints, keyphrases
Digital Object Identifier (DOI)
https://doi.org/10.2298/CSIS100906003W
Publication information
Volume 8, Issue 3 (June 2011)
Year of Publication: 2011
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium
Full text
Available in PDF
Portable Document Format
How to cite
Wang, J., Wu, S., Li, G., Wei, Z.: Integrating Instance-level and Attribute-level Knowledge into Document Clustering. Computer Science and Information Systems, Vol. 8, No. 3, 635-651. (2011), https://doi.org/10.2298/CSIS100906003W