Chinese Journal on Internet of Things ›› 2023, Vol. 7 ›› Issue (2): 76-87.doi: 10.11959/j.issn.2096-3750.2023.00337
• Theory and Technology • Previous Articles Next Articles
Wenchang LIU1, Yun WEI1, Haoxuan YUAN2, Yue GAO2
Revised:
2023-03-07
Online:
2023-06-30
Published:
2023-06-01
Supported by:
CLC Number:
Wenchang LIU, Yun WEI, Haoxuan YUAN, Yue GAO. Research on medical small sample data classification based on SMOTE and gcForest[J]. Chinese Journal on Internet of Things, 2023, 7(2): 76-87.
"
数据集 | 指标 | LR | SVM | gcForest | TBDForest | cgicForest |
Diabetes | ACC | 0.776 0 | 0.760 4 | 0.781 2 | 0.786 5 | 0.802 1 |
PRE | 0.759 9 | 0.729 2 | 0.756 9 | 0.756 9 | 0.779 7 | |
REC | 0.699 6 | 0.696 5 | 0.720 3 | 0.745 3 | 0.752 6 | |
F1 | 0.714 7 | 0.707 0 | 0.732 4 | 0.750 4 | 0.763 0 | |
AUC | 0.849 9 | 0.805 1 | 0.853 4 | 0.838 5 | 0.857 1 | |
ECG | ACC | 0.708 0 | 0.699 1 | 0.761 1 | 0.769 9 | 0.805 3 |
PRE | 0.708 5 | 0.698 6 | 0.761 8 | 0.771 9 | 0.815 9 | |
REC | 0.701 1 | 0.692 9 | 0.756 0 | 0.764 2 | 0.797 0 | |
F1 | 0.702 0 | 0.693 7 | 0.757 3 | 0.765 8 | 0.799 6 | |
AUC | 0.767 0 | 0.798 2 | 0.810 8 | 0.814 6 | 0.822 2 | |
Cirrhosis | ACC | 0.781 0 | 0.742 9 | 0.752 4 | 0.752 4 | 0.763 3 |
PRE | 0.781 5 | 0.715 7 | 0.736 9 | 0.732 2 | 0.753 9 | |
REC | 0.713 8 | 0.691 4 | 0.685 4 | 0.692 0 | 0.692 0 | |
F1 | 0.728 9 | 0.699 2 | 0.696 7 | 0.702 5 | 0.702 5 | |
AUC | 0.807 2 | 0.755 2 | 0.759 3 | 0.756 8 | 0.788 4 |
"
数据集 | 指标 | LR | SVM | gcForest | TBDForest | cgicForest |
Diabetes | ACC | 0.804 0 | 0.868 0 | 0.868 0 | 0.873 6 | 0.884 0 |
PRE | 0.804 1 | 0.869 0 | 0.870 7 | 0.873 7 | 0.885 8 | |
REC | 0.804 1 | 0.868 2 | 0.868 3 | 0.870 3 | 0.884 3 | |
F1 | 0.804 0 | 0.867 9 | 0.867 8 | 0.872 9 | 0.883 9 | |
AUC | 0.876 2 | 0.927 0 | 0.933 9 | 0.934 2 | 0.938 7 | |
ECG | ACC | 0.780 5 | 0.772 4 | 0.821 1 | 0.826 7 | 0.837 4 |
PRE | 0.793 4 | 0.779 3 | 0.832 5 | 0.835 6 | 0.844 9 | |
REC | 0.791 0 | 0.780 2 | 0.831 3 | 0.835 8 | 0.846 0 | |
F1 | 0.780 4 | 0.772 3 | 0.821 1 | 0.824 9 | 0.837 4 | |
AUC | 0.840 6 | 0.869 8 | 0.902 9 | 0.904 1 | 0.9 080 | |
Cirrhosis | ACC | 0.715 3 | 0.766 4 | 0.832 1 | 0.835 3 | 0.846 7 |
PRE | 0.712 9 | 0.772 7 | 0.831 8 | 0.835 8 | 0.846 4 | |
REC | 0.718 8 | 0.772 0 | 0.834 0 | 0.835 9 | 0.848 7 | |
F1 | 0.715 3 | 0.766 4 | 0.831 8 | 0.833 9 | 0.846 4 | |
AUC | 0.796 2 | 0.834 4 | 0.913 0 | 0.914 7 | 0.918 1 |
[1] | 周志华 . 机器学习[M]. 北京: 清华大学出版社, 2016. |
ZHOU Z H . Machine learning[M]. Beijing: Tsinghua University Publishing House, 2016. | |
[2] | CHEN M , SHI X B , ZHANG Y ,et al. Deep feature learning for medical image analysis with convolutional autoencoder neural network[J]. IEEE Transactions on Big Data, 2021,7(4): 750-758. |
[3] | HU G S , PENG X J , YANG Y X ,et al. Frankenstein:learning deep face representations using small data[J]. IEEE Transactions on Image Processing:a Publication of the IEEE Signal Processing Society, 2018,27(1): 293-303. |
[4] | 李春生, 曹琦, 于澍 . 针对小规模数据集的多模型融合算法研究[J]. 计算机技术与发展, 2020,30(2): 63-66. |
LI C S , CAO Q , YU S . Research on multi-model fusion algorithm for small scale data sets[J]. Computer Technology and Development, 2020,30(2): 63-66. | |
[5] | 薛参观, 燕雪峰 . 基于改进深度森林算法的软件缺陷预测[J]. 计算机科学, 2018,45(8): 160-165. |
XUE C G , YAN X F . Software defect prediction based on improved deep forest algorithm[J]. Computer Science, 2018,45(8): 160-165. | |
[6] | ZHOU Z H , FENG J . Deep forest:towards an alternative to deep neural networks[EB]. 2017. |
[7] | 何宏, 陈叔达 . 面部表情的深度卷积级联森林识别[J]. 小型微型计算机系统, 2021,42(4): 805-809. |
HE H , CHEN S D . Deep convolutional cascade forest for facial expression recognition[J]. Journal of Chinese Computer Systems, 2021,42(4): 805-809. | |
[8] | 颜建军, 刘章鹏, 刘国萍 ,等. 基于深度森林算法的慢性胃炎中医证候分类[J]. 华东理工大学学报(自然科学版), 2019,45(4): 593-599. |
YAN J J , LIU Z P , LIU G P ,et al. Syndrome classification of chronic gastritis based on multi-grained cascade forest[J]. Journal of East China University of Science and Technology, 2019,45(4): 593-599. | |
[9] | CHEN Z H , LI L P , HE Z ,et al. An improved deep forest model for predicting self-interacting proteins from protein sequence using wavelet transformation[J]. Frontiers in Genetics, 2019(10): 90. |
[10] | UTKIN L , KONSTANTINOV A , MELDO A ,et al. A deep forest improvement by using weighted schemes[C]// Proceedings of 2019 24th Conference of Open Innovations Association (FRUCT). Piscataway:IEEE Press, 2019: 451-456. |
[11] | GUO Y , LIU S H , LI Z H ,et al. BCDForest:a boosting cascade deep forest model towards the classification of cancer subtypes based on gene expression data[J]. BMC Bioinformatics, 2018,19(Suppl 5): 118. |
[12] | HUANG G , LIU Z , VAN DER MAATEN L ,et al. Densely connected convolutional networks[C]// Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2017: 2261-2269. |
[13] | WANG H Y , TANG Y , JIA Z Y ,et al. Dense adaptive cascade forest:a self-adaptive deep ensemble for classification problems[J]. Soft Computing, 2020,24(4): 2955-2968. |
[14] | UTKIN L V , RYABININ M A . A Siamese deep forest[J]. Knowledge-Based Systems, 2018,139: 13-22. |
[15] | FAN Y M , QI L , TIE Y . The cascade improved model based deep forest for small-scale datasets classification[C]// Proceedings of 2019 8th International Symposium on Next Generation Electronics (ISNE). Piscataway:IEEE Press, 2019: 1-3. |
[16] | LIU H , ZHANG N , JIN S G ,et al. Small sample color fundus image quality assessment based on gcforest[J]. Multimedia Tools and Applications, 2021,80(11): 17441-17459. |
[17] | 刘超, 吴申, 郑一超 ,等. 基于深度森林和DNA甲基化的癌症分类研究[J]. 计算机工程与应用, 2020,56(13): 189-193. |
LIU C , WU S , ZHENG Y C ,et al. Classification of cancer based on deep forest and DNA methylation[J]. Computer Engineering and Applications, 2020,56(13): 189-193. | |
[18] | XU Z Z . A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data[J]. Information Sciences, 2021(572): 574-589. |
[19] | PEREIRA R M , COSTA Y M G , SILLA C N Jr . MLTL:a multi-label approach for the Tomek Link under sampling algorithm[J]. Neurocomputing, 2020 (383): 95-105. |
[20] | YUAN Z W , ZHAO P . An improved ensemble learning for imbalanced data classification[C]// Proceedings of 2019 IEEE 8th Joint International Information Technology and Artificial Intelligence Conference (ITAIC). Piscataway:IEEE Press, 2019: 408-411. |
[21] | REN X Y , YUAN Z Y , HUANG J M . Research on fake reviews detection based on feature construction and Easy Ensemble-RF[C]// Proceedings of 2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE). Piscataway:IEEE Press, 2022: 478-482. |
[22] | XU X L , CHEN W , SUN Y F . Over-sampling algorithm for imbalanced data classification[J]. Journal of Systems Engineering and Electronics, 2019,30(6): 1182-1191. |
[23] | BREIMAN L . Random forests[J]. Machine Learning, 2001,45(1): 5-32. |
[24] | GEURTS P , ERNST D , WEHENKEL L . Extremely randomized trees[J]. Machine Learning, 2006,63(1): 3-42. |
[25] | BREIMAN L . Stacked regressions[J]. Machine Learning, 1996,24(1): 49-64. |
[26] | 吴辰文, 梁靖涵, 王伟 ,等. 一种顺序响应的随机森林:变量预测和选择[J]. 小型微型计算机系统, 2017,38(8): 1762-1766. |
WU C W , LIANG J H , WANG W ,et al. Random forest algorithm for sequential response:prediction and selection of variables[J]. Journal of Chinese Computer Systems, 2017,38(8): 1762-1766. | |
[27] | 乔健, 诸佳慧, 严康桓 . 基于随机森林CART特征选择改进算法的电信客户流失预测模型[J]. 电信工程技术与标准化, 2022,35(3): 78-82. |
QIAO J , ZHU J H , YAN K H . Telecom customer churn prediction model based on improved random forest cart feature selection algorithm[J]. Telecom Engineering Technics and Standardization, 2022,35(3): 78-82. | |
[28] | TAHMASSEBI A , GANDOMI A H , SCHULTE M H J ,et al. Optimized naive-Bayes and decision tree approaches for fMRI smoking cessation classification[J]. Complexity, 2018: 1-24. |
[29] | HSSINA B , MERBOUHA A , EZZIKOURI H ,et al. A comparative study of decision tree ID3 and C4.5[J]. International Journal of Advanced Computer Science and Applications, 2014,4(2): 126-133. |
[30] | 李孝伟, 陈福才, 李邵梅 . 基于分类规则的C4.5决策树改进算法[J]. 计算机工程与设计, 2013,34(12): 4321-4325,4330. |
LI X W , CHEN F C , LI S M.Improved C4 . 5decision tree algorithm based on classification rules[J]. Computer Engineering and Design, 2013,34(12): 4321-4325,4330. |
[1] | Zhu WANG, Hualei ZHANG, Qianhong HU, Zhiwen YU. An environment adaptive gesture recognition system based on visible light [J]. Chinese Journal on Internet of Things, 2023, 7(2): 15-25. |
[2] | Yuting ZENG, Suzhi BI, Lili ZHENG, Xiaohui LIN, Hui WANG. Scenario-adaptive wireless fall detection system based on few-shot learning [J]. Chinese Journal on Internet of Things, 2023, 7(2): 118-132. |
[3] | Kai JIANG, Yue CAO, Huan ZHOU, Xuefeng REN, Yongdong ZHU, Hai LIN. Edge intelligence empowered internet of vehicles: concept, framework, issues, implementation, and prospect [J]. Chinese Journal on Internet of Things, 2023, 7(1): 37-48. |
[4] | Peiyan YUAN, Saike SHAO, Ran WEI, Junna ZHANG, Xiaoyan ZHAO. Research on the cooperative offloading strategy of sensory data based on delay and energy constraints [J]. Chinese Journal on Internet of Things, 2023, 7(1): 109-117. |
[5] | Lin SU, Xiaochao DANG, Zhanjun HAO, Chunrui RU, Xu SHANG. Dynamic adaptive offloading method based on WPT-MEC [J]. Chinese Journal on Internet of Things, 2022, 6(4): 128-138. |
[6] | Jiahui GUO, Zhuoyue CHEN, Wei GAO, Xijun WANG, Xinghua SUN, Lin GAO. Clients selection method based on knapsack model in federated learning [J]. Chinese Journal on Internet of Things, 2022, 6(4): 158-168. |
[7] | Yangqun LI, Dengyin ZHANG. Research and application of Web of things resource management framework [J]. Chinese Journal on Internet of Things, 2022, 6(2): 50-64. |
[8] | Dan LUO, Ruzhi XU, Zhitao GUAN. Differential privacy budget optimization based on deep learning in IoT [J]. Chinese Journal on Internet of Things, 2022, 6(2): 65-76. |
[9] | Fenfang LI, Xiaochao DANG, Zhanjun HAO. Weighted mixed regression localization method based on three-dimensional Voronoi diagram division [J]. Chinese Journal on Internet of Things, 2022, 6(2): 106-116. |
[10] | Juan FANG, Zhiyuan YE, Mengyuan ZHANG, Jiamei SHI, Ziyi TENG. Research on elite hierarchical task offloading strategy based on reinforcement learning in edge-cloud collaboration scenario [J]. Chinese Journal on Internet of Things, 2022, 6(1): 91-100. |
[11] | Yi ZHOU, Shuting HU, Wei LI, Nan CHENG, Ning LU, Xuemin(Sherman) SHEN. Graph neural network driven traffic prediction technology:review and challenge [J]. Chinese Journal on Internet of Things, 2021, 5(4): 1-16. |
[12] | Jiahua MA, Xinghua SUN, Wenchao XIA, Xijun WANG, Hongzhou TAN, Hongbo ZHU. Node selection based on label quantity information in federated learning [J]. Chinese Journal on Internet of Things, 2021, 5(4): 46-53. |
[13] | Minmin MAO, Jiaqi JU, Yuling OUYANG, Yan JIN. Design and implementation of NB-IoT based environmental temperature and humidity monitoring system [J]. Chinese Journal on Internet of Things, 2021, 5(4): 99-106. |
[14] | Ganlin ZHAO, Chang YU, Jianfu ZHANG, Jianxin YANG, Pingfa FENG, Qun SHEN. Inspection method for cable assembly quality based on AR virtual-real image attention mechanism [J]. Chinese Journal on Internet of Things, 2021, 5(3): 27-38. |
[15] | Jinying CAI, Feng XIANG, Ying ZUO, Lei ZHONG, Ping ZHOU. Research on a lightweight framework of industrial Internet-oriented manufacturing service collaboration level agreement [J]. Chinese Journal on Internet of Things, 2021, 5(3): 49-55. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||
|