Loading [MathJax]/extensions/MathMenu.js
Adaptive Centroid-Based Clustering Algorithm for Text Document Data | IEEE Conference Publication | IEEE Xplore

Adaptive Centroid-Based Clustering Algorithm for Text Document Data


Abstract:

Document clustering is a significantly popularresearch, which aims to partition a corpus into many subgroupsof homogeneous documents. Traditional clustering approachescat...Show More

Abstract:

Document clustering is a significantly popularresearch, which aims to partition a corpus into many subgroupsof homogeneous documents. Traditional clustering approachescatholically lack of considerations of word weights with clusters. To address this problem, we propose an Adaptive CentroidbasedClustering (ACC) algorithm. As a successful supervisedcentroid-based classifier, Class-Feature-Centroid (CFC) algorithmtakes relationships among words into account. ACCattempts to employ this discriminative CFC vector to drive theclustering procedure. Since clustering is unsupervised, ACCbegins with hundreds of small clusters for acceptable CFCvectors, and then iteratively regroups clusters of documentsuntil convergence. As ACC is self-organized, it can determinethe number of clusters adaptively. The experimental resultsvalidate that ACC achieves competitive performance with thestate-of-art clustering approaches.
Date of Conference: 13-15 July 2014
Date Added to IEEE Xplore: 07 October 2014
ISBN Information:

ISSN Information:

Conference Location: Beijing, China

References

References is not available for this document.