智能科学与技术学报 ›› 2023, Vol. 5 ›› Issue (1): 58-68.doi: 10.11959/j.issn.2096-6652.202307

• 学术论文 • 上一篇    下一篇

基于显著性信息的Fit CutMix数据增强算法在医学影像上的应用

罗欣欢1,2, 王奕璇1,2, 李炜1,2, 陈曦1,2   

  1. 1 华中科技大学人工智能与自动化学院,湖北 武汉 430074
    2 图像信息处理与智能控制教育部重点实验室,湖北 武汉 430074
  • 修回日期:2023-03-03 出版日期:2023-03-15 发布日期:2023-03-01
  • 作者简介:罗欣欢(1998- ),女,华中科技大学人工智能与自动化学院硕士生,主要研究方向为基于数据增强的阿尔茨海默病辅助诊断
    王奕璇(1998- ),男,华中科技大学人工智能与自动化学院硕士生,主要研究方向为基于深度学习的阿尔茨海默病早期诊断与转换预测
    李炜(1975- ),女,华中科技大学人工智能与自动化学院教授,主要研究方向为神经信号分析、神经可塑性、脑网络分析和神经解码
    陈曦(1975- ),男,华中科技大学人工智能与自动化学院教授,主要研究方向为信号处理与分析、系统建模与仿真、复杂网络等
  • 基金资助:
    国家自然科学基金资助项目(61473131)

Application of Fit CutMix data augmentation algorithm based on saliency information in medical images

Xinhuan LUO1,2, Yixuan WANG1,2, Wei LI1,2, Xi CHEN1,2   

  1. 1 School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China
    2 Key Laboratory of Image Processing and Intelligent Control, Ministry of Education, Huazhong University of Science and Technology, Wuhan 430074, China
  • Revised:2023-03-03 Online:2023-03-15 Published:2023-03-01
  • Supported by:
    The National Natural Science Foundation of China(61473131)

摘要:

深度卷积神经网络是图像分类领域的主流算法之一,但是其训练需要大量标注数据,在阿尔茨海默病医学影像等小数据集上易出现过拟合现象。数据增强算法可以用于扩充数据量,其中 CutMix 数据增强算法近来被广泛应用,但是现有方法生成的增强图像往往忽略原始图像显著区域,且增强图像的标签设计考虑的因素较为单一。针对这些问题,提出Fit CutMix数据增强算法。该算法一是利用基于显著性极值迁移的区域替换策略生成增强样本,集中源样本与目标样本中显著性高的区域;二是综合源样本与目标样本的面积和显著性信息赋予增强样本标签,为卷积神经网络提供有效的监督信息。实验结果表明,将Fit CutMix数据增强算法用于ResNet50对阿尔茨海默病进行诊断时,准确率达96.6%,比直接使用ResNet50提高了约7%,且比应用现有数据增强算法至少提高3%,可见Fit CutMix数据增强算法可以有效提高深度卷积神经网络对医学影像识别的准确率。

关键词: 数据增强, 卷积神经网络, 阿尔茨海默病, 显著性检测

Abstract:

Deep convolutional neural network is one of the mainstream algorithms in the field of image classification, but its training requires a large number of labeled data, which leads to over fitting on small datasets such as Alzheimer's medical images.Data augmentation can increase the amount of training data, and CutMix data augmentation algorithm has been widely used recently.However, the augmented images generated by the CutMix series methods often ignore the significant area of the original image, and the design of the label of the augmented image takes only single factor into consideration.In order to solve these problems, the Fit CutMix data augmentation algorithm was proposed.Firstly, the region replacement strategy based on the transfer of saliency extreme value was used to generate augmented samples, so as to concentrate the regions with high saliency value in the source samples and target samples.Secondly, the area and saliency information of the source samples and the target samples were combined to assign the augmented sample label, which provided effective supervision information for the convolutional neural network.The experimental results showed that when Fit CutMix was used in ResNet50 to diagnose Alzheimer's disease, the accuracy was 96.6%, which was about 7% higher than that of directly using ResNet50, and at least 3% higher than that of applying existing methods.Therefore, the Fit CutMix data augmentation algorithm can effectively improve the recognition accuracy of deep convolutional neural network for medical images.

Key words: data augmentation, convolutional neural network, Alzheimer's disease, saliency detection

中图分类号: 

No Suggested Reading articles found!