电信科学 ›› 2023, Vol. 39 ›› Issue (10): 101-111.doi: 10.11959/j.issn.1000-0801.2023188

• 研究与开发 • 上一篇    

基于优化卷积神经网络的车辆特征识别算法研究

陈暄1, 吴吉义2,3   

  1. 1 浙江工业职业技术学院,浙江 绍兴 312000
    2 浙江省人工智能学会,浙江 杭州 310027
    3 浙江大学智能教育研究中心,浙江 杭州 310027
  • 修回日期:2023-10-10 出版日期:2023-10-01 发布日期:2023-10-01
  • 作者简介:陈暄(1979- ),男,浙江工业职业技术学院副教授,主要研究方向为云计算、人工智能
    吴吉义(1980- ),男,博士,浙江大学高级工程师,主要研究方向为服务计算、人工智能
  • 基金资助:
    国家自然科学基金资助项目(61702151);国家自然科学基金资助项目(61702320);国家自然科学基金资助项目(61772334);国家重点研发计划项目(2018YFB1003800);浙江省哲学社会科学规划课题(23NDJC369YB)

Research on vehicle feature recognition algorithm based on optimized convolutional neural network

Xuan CHEN1, Jiyi WU2,3   

  1. 1 Zhejiang Industry Polytechnic College, Shaoxing 312000, China
    2 Zhejiang Federation of Artificial Intelligence, Hangzhou 310027, China
    3 Intelligent Education Research Center, Zhejiang University, Hangzhou 310027, China
  • Revised:2023-10-10 Online:2023-10-01 Published:2023-10-01
  • Supported by:
    The National Natural Science Foundation of China(61702151);The National Natural Science Foundation of China(61702320);The National Natural Science Foundation of China(61772334);The National Key Research and Development Program of China(2018YFB1003800);Zhejiang Provincial Philosophy and Social Sciences Planning Project(23NDJC369YB)

摘要:

针对道路场景图像中不同距离目标车辆特征存在识别效果弱、精度低的问题,提出一种基于优化卷积神经网络的车辆特征识别算法。首先,采用基于PAN模型的多尺度输入获取不同距离的目标车辆特征;其次,在卷积神经网络结构中加入多池化、BN层和Leaky ReLU激活函数改进网络模型的性能,通过引入混合注意力机制,集中关注车辆图像中的重要特征和区域,从而增强了网络模型的泛化能力;最后,通过构建多层次卷积神经网络结构完成对车辆的特征效果识别。仿真实验结果表明,在单一场景的BIT-Vehicle数据库中,本文算法相比CNN、R-CNN、ABC-CNN、Faster R-CNN、AlexNet、VGG16和YOLOV8在单一目标和多目标识别率方面分别提高了 16.75%、10.9%、4%、3.7%、2.46%、1.3%、1%和 17.8%、10.5%、2.5%、3.8%、2.7%、1.1%、1.3%,在复杂场景的UA-DETRAC数据库中,本文算法相比其他算法在不同距离目标车辆识别中获得了更加精确的效果。

关键词: 车辆识别, 卷积神经网络, 多尺度输入

Abstract:

To address the issue of weak identification and low accuracy in recognizing features of target vehicles at different distances in road scene images, a vehicle feature recognition algorithm based on optimized convolutional neural network (CNN) was proposed.Firstly, a multi-scale input based on the PAN model was employed to capture target vehicle features at varying distances.Subsequently, improvements were made to the network model by incorporating multi-pool, batch normalization (BN) layers, and Leaky ReLU activation functions within the CNN architecture.Furthermore, the generalization ability of the network model was enhanced by introducing a hybrid attention mechanism that focuses on important features and regions in the vehicle image.Lastly, a multi-level CNN structure was constructed to achieve feature recognition for vehicles.Simulation experiment results conducted on the BIT-Vehicle database within a single scene show the proposed algorithm’s significant enhancements in single-object and multi-object recognition rates compared to CNN, R-CNN, ABC-CNN, Faster R-CNN, AlexNet, VGG16, and YOLOV8.Specifically, improvements of 16.75%, 10.9%, 4%, 3.7%, 2.46%, 1.3%, and 1% in single-object recognition, as well as 17.8%, 10.5%, 2.5%, 3.8%, 2.7%, 1.1%, and 1.3% in multi-object recognition, have been demonstrated by the proposed algorithm, respectively.Over the more complex UA-DETRAC datasets, more precise results have been also achieved by the proposed algorithm in recognizing target vehicles at various distances compared to other algorithms.

Key words: vehicle recognition, convolutional neural network, multi-scale input

中图分类号: 

No Suggested Reading articles found!