机构:[1]School of Information Science and Technology, Sun Yat-sen University, Guangzhou, 510275, China[2]The 2nd Affiliated Hospital, Guangzhou University of Chinese Medicine, Guangzhou, 510120, China
Clustering analysis is an important technique used in many fields. But traditional clustering algorithms generally deal with numeric data. While clustering categorical data have always attracted researchers' attentions because of their prevalence in real life. This paper analyses limitations of the categorical clustering algorithms proposed. Based on two observations, a new similarity measure is proposed for categorical data which considers the unbalance of attributes. As the data are getting much larger and more dynamic, incremental is an important quality of good clustering algorithms. The clustering algorithm present is an incremental with linear computing complexity. The experiment results indicate that it outperforms other categorical clustering algorithms referred in the paper.
基金:
National Natural Science Foundation of ChinaNational Natural Science Foundation of China [60773198, 60703111]; Natural Science Foundation of Guangdong ProvinceNational Natural Science Foundation of Guangdong Province [06104916, 8151027501000021, 7300272]; New Century Excellent Talents in University of ChinaProgram for New Century Excellent Talents in University (NCET) [NCET-06-0727]; Research Foundation of Science and Technology Plan Project in Guangdong Province [2007B031403003]; National Key Technology R&D Program in the 11th Five year Plan of ChinaNational Key Technology R&D Program [2006BAI13B02]
语种:
外文
WOS:
第一作者:
第一作者机构:[1]School of Information Science and Technology, Sun Yat-sen University, Guangzhou, 510275, China
通讯作者:
推荐引用方式(GB/T 7714):
Jize Chen,Zhimin Yang,Jian Yin,et al.An Incremental Clustering with Attribute Unbalance Considered for Categorical Data[J].COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS.2009,51:433-+.doi:10.1007/978-3-642-04962-0_50.
APA:
Jize Chen,Zhimin Yang,Jian Yin,Xiaobo Yang&Li Huang.(2009).An Incremental Clustering with Attribute Unbalance Considered for Categorical Data.COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS,51,
MLA:
Jize Chen,et al."An Incremental Clustering with Attribute Unbalance Considered for Categorical Data".COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS 51.(2009):433-+