Protein interactions are of great biological interest because they orchestrate nearly all cellular processes and can further our understandings in biological processes and diseases. Protein interaction data like many real world datasets are imbalanced in nature. Most protein pairs belong to the non-interaction class and few belong to the interaction class. Most existing protein interaction prediction methods assume equal distribution of the positive and negative interaction data. In this study, we first analyze effects of various portions of negative samples on the performance of domain-based protein interaction prediction methods using Artificial Neural Network (ANN), Bayesian Network (BN), and SVM. Then we introduce cost-sensitive learning to address the class imbalance problem. Experimental results demonstrated that the addition of cost-sensitive learning to each classifier: ANN, BN, and SVM, indeed yields an increase in accuracy.
基金:
National Natural Science Foundation of ChinaNational Natural Science Foundation of China [NSFC,70801020, NSFC, 60773198, NSFC, 60573097, NSFC, 70572053]; Natural Science Foundation of Guangdong ProvinceNational Natural Science Foundation of Guangdong Province [7300272]; Research Foundation of Science, Technology Plan Project in Guangdong Province [2007B031403003]; Sun Yat-sen University "211 Project" Construction Projects of Phase III Key Discipline [NECT-06-0737, GDUFS(399-X3207018), GDUFS(GWQ0718)]
语种:
外文
WOS:
第一作者:
第一作者机构:[1]Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou 510275, Guangdong, Peoples R China
推荐引用方式(GB/T 7714):
Guo Weizhao,Hu Yong,Liu Mei,et al.Exploring Cost-Sensitive Learning in Domain Based Protein-Protein Interaction Prediction[J].SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009).2009,56:175-+.
APA:
Guo, Weizhao,Hu, Yong,Liu, Mei,Yin, Jian,Xie, Kang&Yang, Xiaobo.(2009).Exploring Cost-Sensitive Learning in Domain Based Protein-Protein Interaction Prediction.SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009),56,
MLA:
Guo, Weizhao,et al."Exploring Cost-Sensitive Learning in Domain Based Protein-Protein Interaction Prediction".SIXTH INTERNATIONAL SYMPOSIUM ON NEURAL NETWORKS (ISNN 2009) 56.(2009):175-+