Journal on Communications

Special Issue: 6G

   

6G-Oriented cross-modal signal reconstruction technology

LI Ang1,2, CHEN Jianxin1,2, WEI Xin1,2, ZHOU Liang1,2   

  1. 1. College of Telecommunications & Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China 2. Key Laboratory of Broadband Wireless Communication and Sensor Network Technology (Ministry of Education), Nanjing University of Posts and Tele-communications, Nanjing 210003, China

Abstract: In the 6G era, to balance the immersive experience needs of multimedia users for audio, video, and haptics with low-latency, high-reliability, and large-capacity communication, a cross-modal signal reconstruction framework and video-to-haptic reconstruction model was proposed. First, robots were controlled to touch various materials. In this way, a large-scale dataset VisTouch that includes audio, video, and haptic signals was constructed. This dataset can lay the foun-dation for subsequent researches on various cross-modal problems. In addition, based on the semantic relations of mul-ti-modal signals, a universe and robust end-to-end cross-modal signal reconstruction framework was designed. Further-more, take the reconstruction from video to haptic signals as an example. A video-assisted haptic reconstruction model was established, including a 3D CNN based video extraction sub-network, a fully convolutional network based GAN generation sub-network and a CNN based GAN discrimination sub-network. Finally, the reliability of the cross-modal signal reconstruction framework and the accuracy of the proposed video-to-haptic model were verified through experi-mental results.

No Suggested Reading articles found!