机构:[1]School of Computing, University of Southern Mississippi, Hattiesburg, MS39406, USA[2]Cooperative Innovation Center of Internet Healthcare, School ofInformation & Engineering, Zhengzhou University, Zhengzhou 450000, China[3]Department of Big Medical Data, Health Construction Administration Center,The Second Affiliated Hospital of Guangzhou University of Chinese Medicine,Guangzhou, China广东省中医院大德路总院[4]Division of Bioinformatics and Biostatistics, NationalCenter for Toxicological Research, US Food and Drug Administration (FDA),Jefferson, AR 72079, USA[5]Environmental Lab, US Army Engineer Researchand Development Center, Vicksburg, MS 39180, USA
Background: Multi-label classification of data remains to be a challenging problem. Because of the complexity of the data, it is sometimes difficult to infer information about classes that are not mutually exclusive. For medical data, patients could have symptoms of multiple different diseases at the same time and it is important to develop tools that help to identify problems early. Intelligent health risk prediction models built with deep learning architectures offer a powerful tool for physicians to identify patterns in patient data that indicate risks associated with certain types of chronic diseases. Results: Physical examination records of 110,300 anonymous patients were used to predict diabetes, hypertension, fatty liver, a combination of these three chronic diseases, and the absence of disease (8 classes in total). The dataset was split into training (90%) and testing (10%) sub-datasets. Ten-fold cross validation was used to evaluate prediction accuracy with metrics such as precision, recall, and F-score. Deep Learning (DL) architectures were compared with standard and state-of-the-art multi-label classification methods. Preliminary results suggest that Deep Neural Networks (DNN), a DL architecture, when applied to multi-label classification of chronic diseases, produced accuracy that was comparable to that of common methods such as Support Vector Machines. We have implemented DNNs to handle both problem transformation and algorithm adaption type multi-label methods and compare both to see which is preferable. Conclusions: Deep Learning architectures have the potential of inferring more information about the patterns of physical examination data than common classification methods. The advanced techniques of Deep Learning can be used to identify the significance of different features from physical examination data as well as to learn the contributions of each feature that impact a patient's risk for chronic diseases. However, accurate prediction of chronic disease risks remains a challenging problem that warrants further studies.
基金:
USA DOD MD5i-USM-1704-001
grant and by the Frontier and Key Technology Innovation Special Grant of
Guangdong Province, China (No. 2014B010118005).
语种:
外文
被引次数:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2016]版:
大类|3 区生物
小类|2 区数学与计算生物学3 区生化研究方法3 区生物工程与应用微生物
最新[2025]版:
大类|4 区生物学
小类|3 区生物工程与应用微生物4 区生化研究方法4 区数学与计算生物学
JCR分区:
出版当年[2015]版:
Q1MATHEMATICAL & COMPUTATIONAL BIOLOGYQ2BIOTECHNOLOGY & APPLIED MICROBIOLOGYQ3BIOCHEMICAL RESEARCH METHODS
最新[2023]版:
Q1MATHEMATICAL & COMPUTATIONAL BIOLOGYQ2BIOCHEMICAL RESEARCH METHODSQ2BIOTECHNOLOGY & APPLIED MICROBIOLOGY