Telecommunications Science ›› 2022, Vol. 38 ›› Issue (7): 75-87.doi: 10.11959/j.issn.1000-0801.2022139

• Research and Development • Previous Articles     Next Articles

Feature enhancement and bilinear feature vector fusion for text detection of mobile industrial containers

Haiyang HU1,2, Zepin LI1,2, Zhongjin LI1,2   

  1. 1 School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China
    2 Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou 310018, China
  • Revised:2022-06-10 Online:2022-07-20 Published:2022-07-01
  • Supported by:
    The National Natural Science Foundation of China(61572162);The National Natural Science Foundation of China(61802095);The Zhejiang Provincial Key Science and Technology Project(2018C01012);The Zhejiang Provincial National Science Foundation of China(LQ17F020003)

Abstract:

In the real factory environment, due to factors such as dim light, irregular text, and limited equipment, text detection becomes a challenging task.Aiming at this problem, a feature vector fusion module based on bilinear operation was designed and combined with feature enhancement and semi-convolution to form a lightweight text detection network RGFFD (ResNet18 + Ghost Module + FPEM(feature pyramid enhancement module)) + FFM(feature fusion module) + DB (differentiable binarization)).Among them, the Ghost module was embedded with a feature enhancement module to improve the feature extraction capability, the bilinear feature vector fusion module fused multi-scale information, and an adaptive threshold segmentation algorithm was added to improve the segmentation capability of the DB module.In the real industrial environment, the RGFFD detection speed reached 6.5 f/s, when using the embedded device UP2 board for text detection of container numbers.At the same time, the detection speed on the public datasets ICDAR2015 and Total-text reached 39.6 f/s and 49.6 f/s, respectively.The accuracy rate on the custom dataset reached 88.9%, and the detection speed was 30.7 f/s.

Key words: text detection, semi-convolution, feature vector fusion, feature enhancement, feature fusion

CLC Number: 

No Suggested Reading articles found!