Adapting K-Means Algorithm for Discovering Clusters in Subspaces

UTSePress Research/Manakin Repository

Search UTSePress Research


Advanced Search

Browse

My Account

Show simple item record

dc.contributor.author Zhao Yanchang en_US
dc.contributor.author Zhang Chengqi en_US
dc.contributor.author Zhang Shichao en_US
dc.contributor.author Zhao Lianwei en_US
dc.contributor.editor Zhou, X; Li, J; Shen, H; Kitsuregaura, M; Zhang, Y en_US
dc.date.accessioned 2009-11-09T05:36:24Z
dc.date.available 2009-11-09T05:36:24Z
dc.date.issued 2006 en_US
dc.identifier 2006005164 en_US
dc.identifier.citation Zhao Yanchang et al. 2006, 'Adapting K-Means Algorithm for Discovering Clusters in Subspaces', Springer, Berlin, Germany, pp. 53-62. en_US
dc.identifier.issn 978-3-540-31142-3 en_US
dc.identifier.other E1 en_US
dc.identifier.uri http://hdl.handle.net/10453/2755
dc.description.abstract Subspace clustering is a challenging task in the field of data mining. Traditional distance measures fail to differentiate the furthest point from the nearest point in very high dimensional data space. To tackle the problem, we design minimal subspace distance which measures the similarity between two points in the subspace where they are nearest to each other. It can discover subspace clusters implicitly when measuring the similarities between points. We use the new similarity measure to improve traditional k-means algorithm for discovering clusters in subspaces. By clustering with low-dimensional minimal subspace distance first, the clusters in low-dimensional subspaces are detected. Then by gradually increasing the dimension of minimal subspace distance, the clusters get refined in higher dimensional subspaces. Our experiments on both synthetic data and real data show the effectiveness of the proposed similarity measure and algorithm. en_US
dc.publisher Springer-Verlag en_US
dc.relation.isbasedon http://dx.doi.org/10.1007/11610113_6 en_US
dc.subject Data mining. en
dc.subject Subspace clustering. en
dc.subject Cluster analysis. en
dc.title Adapting K-Means Algorithm for Discovering Clusters in Subspaces en_US
dc.parent Frontiers of WWW Research and Development - APWeb 2006 en_US
dc.journal.volume 3841 en_US
dc.journal.number en_US
dc.publocation Berlin, Germany en_US
dc.identifier.startpage 53 en_US
dc.identifier.endpage 62 en_US
dc.cauo.name Information Technology en_US
dc.conference 8th Asia-Pacific Web Conference Frontiers of WWW Research and Development - APWeb 2006 en_US
dc.conference.location Habin, China en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record