Telecommunications Science ›› 2022, Vol. 38 ›› Issue (1): 61-72.doi: 10.11959/j.issn.1000-0801.2022014
• Research and Development • Previous Articles Next Articles
Yue CHEN1, Yu GUO1,2, Yuanyan XIE1, Zhenqiang MI1
Revised:
2021-11-19
Online:
2022-01-20
Published:
2022-01-01
CLC Number:
Yue CHEN, Yu GUO, Yuanyan XIE, Zhenqiang MI. Offline visual aid system for the blind based on image captioning[J]. Telecommunications Science, 2022, 38(1): 61-72.
[1] | 康帅, 章坚武, 朱尊杰 ,等. 改进 YOLOv4 算法的复杂视觉场景行人检测方法[J]. 电信科学, 2021,37(8): 46-56. |
KANG S , ZHANG J W , ZHU Z J ,et al. An improved YOLOv4 algorithm for pedestrian detection in complex visual scenes[J]. Telecommunications Science, 2021,37(8): 46-56. | |
[2] | MAO J H , XU W , YANG Y ,et al. Explain images with multimodal recurrent neural networks[EB]. 2014. |
[3] | VINYALS O , TOSHEV A , BENGIO S ,et al. Show and tell:a neural image caption generator[C]// Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2015. |
[4] | ANDERSON P , HE X D , BUEHLER C ,et al. Bottom-up and top-down attention for image captioning and visual question answering[C]// Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway:IEEE Press, 2018: 6077-6086. |
[5] | LUO Y P , JI J Y , SUN X S ,et al. Dual-level collaborative transformer for image captioning[EB]. 2021. |
[6] | YANG X , TANG K H , ZHANG H W ,et al. Auto-encoding scene graphs for image captioning[C]// Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2019: 10685-10694. |
[7] | CHEN S Z , JIN Q , WANG P ,et al. Say as you wish:fine-grained control of image caption generation with abstract scene graphs[C]// Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2020: 9962-9971. |
[8] | WANG Z Y , FENG B , NARASIMHAN K ,et al. Towards unique and informative captioning of images[M]// Computer Vision – ECCV 2020.Cham:Springer International Publishing,[S.l.:s.n.], 2020: 629-644. |
[9] | XU G H , NIU S C , TAN M K ,et al. Towards accurate text-based image captioning with content diversity exploration[C]// Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2021: 12637-12646. |
[10] | DENTON E , ZAREMBA W,BRUNA , et al . Exploiting linear structure within convolutional networks for efficient evaluation[C]// Advances in neural information processing systems. Cambridge:MIT Press, 2014: 1269-1277. |
[11] | ZHUANG Z W , TAN M K , ZHUANG B H ,et al. Discrimination-aware channel pruning for deep neural networks[EB]. 2018. |
[12] | RASTEGARI M , ORDONEZ V , REDMON J ,et al. Xnor-net:imagenet classification using binary convolutional neural networks[C]// European conference on computer vision. Berlin:Springer, 2016: 525-542. |
[13] | WANG K , LIU Z J , LIN Y J ,et al. HAQ:hardware-aware automated quantization with mixed precision[C]// Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2019: 8612-8620. |
[14] | CHEN H T , WANG Y H , XU C ,et al. Data-free learning of student networks[C]// Proceedings of 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway:IEEE Press, 2019: 3514-3522. |
[15] | LUO L C , SANDLER M , LIN Z ,et al. Large-scale generative data-free distillation[EB]. 2020. |
[16] | YU X Y , LIU T L , WANG X C ,et al. On compressing deep models by low rank and sparse decomposition[C]// Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2017: 7370-7379. |
[17] | YANG Z , WANG Y , LIU C ,et al. Legonet:efficient convolutional neural networks with lego filters[C]// International Conference on Machine Learning. New York:ACM Press, 2019: 7005-7014. |
[18] | CHEN H T , WANG Y H , XU C J ,et al. AdderNet:do we really need multiplications in deep learning?[C]// Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2020: 1468-1477. |
[19] | XU Y , XU C , CHEN X ,et al. Kernel based progressive distillation for adder neural networks[EB]. 2020. |
[20] | SONG D H , WANG Y H , CHEN H T ,et al. AdderSR:towards energy efficient image super-resolution[C]// Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway:IEEE Press, 2021: 15648-15657. |
[21] | PARK Y , YUN I D . Fast adaptive RNN Encoder?Decoder for anomaly detection in SMD assembly machine[J]. Sensors (Basel,Switzerland), 2018,18(10): 3573. |
[22] | XU K , BA J , KIROS R ,et al. Show,attend and tell:neural image caption generation with visual attention[EB]. 2015. |
[23] | XINGJIAN S H I , CHEN Z , WANG H ,et al. Convolutional LSTM network:A machine learning approach for precipitation nowcasting[C]// Advances in neural information processing systems. Cambridge:MIT Press, 2015: 802-810. |
[24] | MOLCHANOV P , TYREE S , KARRAS T ,et al. Pruning convolutional neural networks for resource efficient inference[EB]. 2016. |
[25] | 王从徐 . 基于泰勒级数展开及其应用探讨[J]. 红河学院学报, 2021,19(02): 154-156. |
WANG C X . Discussion on Taylor series expansion and its application[J]. Journal of Honghe University, 2021,19(02): 154-156. | |
[26] | HODOSH M , YOUNG P , HOCKENMAIER J . Framing image description as a ranking task:data,models and evaluation metrics[J]. Journal of Artificial Intelligence Research, 2013,47: 853-899. |
[27] | 蔡鑫 . 基于 Bert 模型的互联网不良信息检测[J]. 电信科学, 2020,36(11): 121-126. |
CAI X . Internet bad information detection based on Bert model[J]. Telecommunications Science, 2020,36(11): 121-126. | |
[28] | LIN C Y , . Rouge:a package for automatic evaluation of summaries[C]// Text summarization branches out. Barcelona:ACL, 2004: 74-81. |
[1] | Honghui JIN, Zhihua JIAN, Man YANG, Chao WU. Synthetic speech detection method using texture feature based on circumferential local ternary pattern [J]. Telecommunications Science, 2023, 39(6): 85-95. |
[2] | Hui MA, Ruiqin WANG, Shuai YANG. A progressive growing of conditional generative adversarial networks model [J]. Telecommunications Science, 2023, 39(6): 105-113. |
[3] | Min LU, Juan HU, Xianchao ZHANG, Weijian DING, Guangxue YUE. Personalized recommendation model based on users multi-features fusion [J]. Telecommunications Science, 2023, 39(5): 101-115. |
[4] | Yong ZHANG, Jikui LIU, Wenlong KE. EEG emotion recognition based on parallel separable convolution and label smoothing regularization [J]. Telecommunications Science, 2023, 39(5): 116-128. |
[5] | Kun DENG, Qingfeng JIANG, Xingyan LIU. Community detection algorithm of hybrid node analysis and edge analysis in complex networks [J]. Telecommunications Science, 2023, 39(4): 87-100. |
[6] | Lijuan YE, Yiting WANG, Licheng ZHU. Cellular automata model based power network attack prediction technology [J]. Telecommunications Science, 2023, 39(4): 173-179. |
[7] | Yishi HAN, Yuxin XU, Tiantian LU. A model of RD-IHSAT rumor dissemination based on coupling network [J]. Telecommunications Science, 2023, 39(2): 118-131. |
[8] | Jia XU, Zhihua JIAN, Honghui JIN, Chao WU, Lin YOU, Yingxiao WU. Synthetic spoofing speech detection method based on center-symmetric local binary pattern [J]. Telecommunications Science, 2023, 39(1): 72-78. |
[9] | Huajian REN, Xiulan HAO, Wenjing XU. Deep learning Chinese input method with incremental vocabulary selection [J]. Telecommunications Science, 2022, 38(12): 56-64. |
[10] | Weina ZHOU, Lu LIU. A real-time detection method for multi-scale ships in complex scenes [J]. Telecommunications Science, 2022, 38(10): 67-78. |
[11] | Nan JIN, Ruiqin WANG, Yuecong LU. Ebbinghaus forgetting curve and attention mechanism based recommendation algorithm [J]. Telecommunications Science, 2022, 38(10): 89-97. |
[12] | Shuai YANG, Ruiqin WANG, Hui MA. Multi-channel based edge-learning graph convolutional network [J]. Telecommunications Science, 2022, 38(9): 95-104. |
[13] | Dongming ZHAO. Research and application practice of knowledge graph technology system for telecom-operators [J]. Telecommunications Science, 2022, 38(8): 151-162. |
[14] | Jiaqi YU, Zhihua JIAN, Jia XU, Lin YOU, Yunlu WANG, Chao WU. Spoofing speech detection algorithm based on joint feature and random forest [J]. Telecommunications Science, 2022, 38(6): 91-99. |
[15] | Qing SHEN, Wenbin GUO, Jungang LOU, Qiangguo YU. Personalized recommendation model with multi-level latent features [J]. Telecommunications Science, 2022, 38(2): 71-83. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||
|