Journal of University of Science and Technology of China ›› 2016, Vol. 46 ›› Issue (10): 867-873.DOI: 10.3969/j.issn.0253-2778.2016.10.011

Previous Articles     Next Articles

Insomnia discriminant analysis based on real-world clinical data

ZHU Wei   

  1. 1.College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China; 2.Shanghai Menorah Information Technology Co., Ltd, Shanghai 201801, China; 3.Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China; 4.National Data Center of Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
  • Received:2016-03-01 Revised:2016-09-16 Online:2016-10-31 Published:2016-10-31

Abstract: A new data preprocessing method based on the real-world medical database was proposed, which can change unstructured data into structured data. Supervised algorithms and semi-supervised algorithms were utilized to verify the effectiveness of the clinical features which were obtained through our data preprocessing method. From the experimental results on the real world dataset, it is found that both supervised classification and semi-supervised algorithms can get a better result based on the clinical symptom features trained from our data preprocessing method. And it is found that the label propagation algorithm not only achieves a great stability on the real Chinese medicine database when compared with classical classification algorithm, but also obtains good results when the ratio is low.

Key words: structurization, semi-supervised learning, label propagation algorithm, TCM, disease identification, insomnia