BackgroundGender information frequently exists in the eligibility criteria of clinical trial text as essential information for participant population recruitment. Particularly, current eligibility criteria text contains the incompleteness and ambiguity issues in expressing transgender population, leading to difficulties or even failure of transgender population recruitment in clinical trial studies.MethodsA new gender model is proposed for providing comprehensive transgender requirement specification. In addition, an automated approach is developed to extract and summarize gender requirements from unstructured text in accordance with the gender model. This approach consists of: 1) the feature extraction module, and 2) the feature summarization module. The first module identifies and extracts gender features using heuristic rules and automatically-generated patterns. The second module summarizes gender requirements by relation inference.ResultsBased on 100,134 clinical trials from ClinicalTrials.gov, our approach was compared with 20 commonly applied machine learning methods. It achieved a macro-averaged precision of 0.885, a macro-averaged recall of 0.871 and a macro-averaged F-1-measure of 0.878. The results illustrated that our approach outperformed all baseline methods in terms of both commonly used metrics and macro-averaged metrics.ConclusionsThis study presented a new gender model aiming for specifying the transgender requirement more precisely. We also proposed an approach for gender information extraction and summarization from unstructured clinical text to enhance transgender-related clinical trial population recruitment. The experiment results demonstrated that the approach was effective in transgender criteria extraction and summarization.
第一作者机构:[1] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou, Guangdong, Peoples R China
通讯作者:
通讯机构:[*1]Guangdong Univ Foreign Studies, Sch Business, Guangzhou, Guangdong, Peoples R China[2] Guangdong Univ Foreign Studies, Sch Business, Guangzhou, Guangdong, Peoples R China[*2]Guangzhou Univ Chinese Med, Affiliated Hosp 2, Guangzhou, Guangdong, Peoples R China[3] Guangzhou Univ Chinese Med, Affiliated Hosp 2, Guangzhou, Guangdong, Peoples R China[*3]South China Normal Univ, Sch Comp Sci, Guangzhou, Guangdong, Peoples R China[4] South China Normal Univ, Sch Comp Sci, Guangzhou, Guangdong, Peoples R China
推荐引用方式(GB/T 7714):
Chen Boyu,Jin Hao,Yang Zhiwen,等.An approach for transgender population information extraction and summarization from clinical trial text[J].BMC MEDICAL INFORMATICS AND DECISION MAKING.2019,19: