融合生成对抗网络和朴素贝叶斯皮肤病诊断方法
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Skin Diseases Diagnosis Method Based on Generative Adversarial Networks and Naive Bayes
  • 作者:商显震 ; 韩萌 ; 孙毓忠 ; 孙宇宁 ; 陈旭 ; 胡满 ; 梅御东
  • 英文作者:SHANG Xianzhen;HAN Meng;SUN Yuzhong;SUN Yuning;CHEN Xu;HU Manman;MEI Yudong;School of Computer Science and Engineering,North Minzu University;State Key Laboratory of Computer Architecture,Institute of Computing Technology,Chinese Academy of Sciences;National Pilot School of Software,Yunnan University;
  • 关键词:皮肤病诊断 ; 朴素贝叶斯(NB) ; 词频-逆文档频率(TF-IDF)算法 ; 生成对抗网络(GAN) ; 类不平衡数据
  • 英文关键词:skin diseases diagnosis;;naive Bayes(NB);;term frequency-inverse document frequency(TF-IDF) algorithm;;generative adversarial networks(GAN);;imbalanced data
  • 中文刊名:KXTS
  • 英文刊名:Journal of Frontiers of Computer Science and Technology
  • 机构:北方民族大学计算机科学与工程学院;中国科学院计算技术研究所计算机体系结构国家重点实验室;云南大学软件学院;
  • 出版日期:2018-07-16 16:49
  • 出版单位:计算机科学与探索
  • 年:2019
  • 期:v.13;No.129
  • 基金:国家自然科学基金Nos.61563001,U1611261;; 国家重点研发计划No.2016YFB1000505;; 计算机应用技术宁夏回族自治区重点学科项目No.PY1703~~
  • 语种:中文;
  • 页:KXTS201906012
  • 页数:11
  • CN:06
  • ISSN:11-5602/TP
  • 分类号:110-120
摘要
不同皮肤病间发病率的差异导致了皮肤病数据类不平衡现象,对使用机器学习方法构建高效、准确的皮肤病诊断模型带来了巨大挑战。提出一种融合生成对抗网络(generative adversarial networks,GAN)和朴素贝叶斯的皮肤病二分类诊断方法:在皮肤病数据集上训练朴素贝叶斯二分类器作为诊断器,创新性地使用GAN为前者生成补充训练样本,使其训练集正负类样本达到平衡。针对皮肤病诊断多分类问题,提出一种融合生成对抗网络和朴素贝叶斯的多分类诊断方法:使用GAN和朴素贝叶斯训练皮肤病单病种二分类器,并结合了词频-逆文档频率算法(term frequency-inverse document frequency,TF-IDF),将多个二分类器组合成一个多分类器作为诊断器。与六种诊断方法进行了对比实验,提出的两种皮肤病诊断方法准确率和召回率均有提升。
        Different incidences of skin diseases lead to the imbalance of skin diseases data, so that there is a great challenge to train an effective and accurate diagnosis model by machine learning. In this paper, a binary classification diagnosis method is proposed, which is a fusion of GAN(generative adversarial networks) and NB(naive Bayes)algorithm. In this method, a naive Bayes binary classifier is trained for each disease diagnosis, and a GAN model is innovatively trained to generate more samples for training the naive Bayes binary classifier, bringing positive samples and negative samples to balance. As for multiple classification of skin diseases diagnosis, a multiple classification diagnosis method is proposed, which is a fusion of GAN and NB algorithm. In this method, a binary classifier is trained by NB algorithm and GAN, with TF-IDF algorithm added in the classifier, and the multiple classifier consists of many binary classifiers. The experiments compare the proposed diagnosis methods with 6 methods. The precision and recall of the two skin diseases diagnosis methods proposed in this paper are improved.
引文
[1]Zhang X J.Dermatovenereology[M].8th ed.Beijing:People's Medical Publishing House,2013.
    [2]Naser S S A,Akkila A N.A proposed expert system for skin diseases diagnosis[J].Journal of Applied Sciences Research,2008,4(12):1682-1693.
    [3]Sun H F,Sun X L,Qi T E,et al.Diagnosing erythematosquamous disease based on hybrid particle swarm optimisation SVM[J].Computer Applications and Software,2015,32(6):192-197.
    [4]Urooj S,Singh S.A novel computer assisted approach for diagnosis of skin disease[C]//Proceedings of the International Conference on Computing for Sustainable Global Development,India,Mar 11-13,2015.Piscataway:IEEE,2015:1585-1590.
    [5]Ghanad N K,Ahmadi S.Combination of PSO algorithm and naive Bayesian classification for Parkinson disease diagnosis[J].Advances in Computer Science:An International Journal,2015,4(4):119-125.
    [6]Orphanou K,Dagliati A,Sacchi L,et al.Combining naive Bayes classifiers with temporal association rules for coronary heart disease diagnosis[C]//Proceedings of the IEEE International Conference on Healthcare Informatics,Chicago,Oct 4-7,2016.Washington:IEEE Computer Society,2016:81-92.
    [7]Jiang W Q,Shen Y F,Ding Y F,et al.A naive Bayes algorithm for tissue origin diagnosis(TOD-Bayes)of synchronous multifocal tumors in the hepatobiliary and pancreatic system[J].International Journal of Cancer,2017,142(2):357-368.
    [8]Sayin R,Keskin S,Hamamci M.Evaluation of several classification methods in carpal tunnel syndrome[J].Journal of the Pakistan Medical Association,2017,67(11):1654-1657.
    [9]Bakrawy L M E.Grey wolf optimization and naive Bayes classifier incorporation for heart disease diagnosis[J].Australian Journal of Basic&Applied Sciences,2017,11(7):64-70.
    [10]Lipton Z C,Kale D C,Elkan C,et al.Learning to diagnose with LSTM recurrent neural networks[J].ar Xiv:1511.03677,2015.
    [11]Lin D,Vasilakos A V,Tang Y,et al.Neural networks for computer-aided diagnosis in medicine:a review[J].Neurocomputing,2016,216:700-708.
    [12]Beyan C,Fisher R.Classifying imbalanced data sets using similarity based hierarchical decomposition[J].Pattern Recognition,2015,48(5):1653-1672.
    [13]Choeikiwong T,Vateekul P.Software defect prediction in imbalanced data sets using unbiased support vector machine[M]//Kim K J.Information Science and Applications.Berlin,Heidelberg:Springer-Verlag,2015.
    [14]Goodfellow I J,Pouget-Abadie J,Mirza M,et al.Generative adversarial networks[J].Advances in Neural Information Processing Systems,2014,3:2672-2680.
    [15]Alcalá-Fdez J,Fernández A,Luengo J,et al.KEEL datamining software tool:data set repository,integration of algorithms and experimental analysis framework[J].Journal of Multiple-Valued Logic&Soft Computing,2011,17:255-287.
    [16]Wu Z H.Dermatology therapeutics[M].Beijing:China Science Publishing&Media Ltd,2013.
    [1]张学军.皮肤性病学[M].8版.北京:人民卫生出版社,2013.
    [3]孙海峰,孙秀玲,齐恩铁,等.基于混合粒子群优化SVM算法的红斑鳞状皮肤病诊断[J].计算机应用与软件,2015,32(6):192-197.
    [16]吴志华.皮肤科治疗学[M].北京:科学出版社,2013.