电信科学 ›› 2022, Vol. 38 ›› Issue (7): 75-87.doi: 10.11959/j.issn.1000-0801.2022139

• 研究与开发 • 上一篇    下一篇

特征增强和双线性特征向量融合的移动端工业货箱文本检测

胡海洋1,2, 厉泽品1,2, 李忠金1,2   

  1. 1 杭州电子科技大学计算机学院,浙江 杭州 310018
    2 浙江省脑机协同智能重点实验室,浙江 杭州 310018
  • 修回日期:2022-06-10 出版日期:2022-07-20 发布日期:2022-07-01
  • 作者简介:胡海洋(1977- ),男,杭州电子科技大学教授,主要研究方向为机器视觉、智能制造
    厉泽品(1997- ),男,杭州电子科技大学硕士生,主要研究方向为计算机视觉、文本检测识别
    李忠金(1988- ),男,杭州电子科技大学讲师,主要研究方向为计算机视觉、移动边缘计算
  • 基金资助:
    国家自然科学基金资助项目(61572162);国家自然科学基金资助项目(61802095);浙江省重点研发计划项目(2018C01012);浙江省自然科学基金资助项目(LQ17F020003)

Feature enhancement and bilinear feature vector fusion for text detection of mobile industrial containers

Haiyang HU1,2, Zepin LI1,2, Zhongjin LI1,2   

  1. 1 School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China
    2 Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou 310018, China
  • Revised:2022-06-10 Online:2022-07-20 Published:2022-07-01
  • Supported by:
    The National Natural Science Foundation of China(61572162);The National Natural Science Foundation of China(61802095);The Zhejiang Provincial Key Science and Technology Project(2018C01012);The Zhejiang Provincial National Science Foundation of China(LQ17F020003)

摘要:

在实际工业环境下,光线昏暗、文本不规整、设备有限等因素,使得文本检测成为一项具有挑战性的任务。针对此问题,设计了一种基于双线性操作的特征向量融合模块,并联合特征增强与半卷积组成轻量级文本检测网络RGFFD(ResNet18+GhostModule+特征金字塔增强模块(feature pyramid enhancement module, FPEM)+ 特征融合模块(feature fusion module,FFM)+可微分二值化(differenttiable binarization,DB))。其中,Ghost模块内嵌特征增强模块,提升特征提取能力,双线性特征向量融合模块融合多尺度信息,添加自适应阈值分割算法提高DB模块分割能力。在实际工厂环境下,采用嵌入式设备UP2 board对货箱编号进行文本检测,RGFFD检测速度达到6.5 f/s。同时在公共数据集ICDAR2015、Total-text上检测速度分别达到39.6 f/s和49.6 f/s,在自定义数据集上准确率达到88.9%,检测速度为30.7 f/s。

关键词: 文本检测, 半卷积, 特征向量融合, 特征增强, 特征融合

Abstract:

In the real factory environment, due to factors such as dim light, irregular text, and limited equipment, text detection becomes a challenging task.Aiming at this problem, a feature vector fusion module based on bilinear operation was designed and combined with feature enhancement and semi-convolution to form a lightweight text detection network RGFFD (ResNet18 + Ghost Module + FPEM(feature pyramid enhancement module)) + FFM(feature fusion module) + DB (differentiable binarization)).Among them, the Ghost module was embedded with a feature enhancement module to improve the feature extraction capability, the bilinear feature vector fusion module fused multi-scale information, and an adaptive threshold segmentation algorithm was added to improve the segmentation capability of the DB module.In the real industrial environment, the RGFFD detection speed reached 6.5 f/s, when using the embedded device UP2 board for text detection of container numbers.At the same time, the detection speed on the public datasets ICDAR2015 and Total-text reached 39.6 f/s and 49.6 f/s, respectively.The accuracy rate on the custom dataset reached 88.9%, and the detection speed was 30.7 f/s.

Key words: text detection, semi-convolution, feature vector fusion, feature enhancement, feature fusion

中图分类号: 

No Suggested Reading articles found!