Journal on Communications ›› 2023, Vol. 44 ›› Issue (5): 28-41.doi: 10.11959/j.issn.1000-436x.2023105

• Topics: Multi-/Cross-Modal Semantic Communications • Previous Articles     Next Articles

Survey of research on multimodal semantic communication

Zhijin QIN1, Tantan ZHAO2, Fan LI2, Xiaoming TAO1   

  1. 1 Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
    2 School of Information and Communication Engineering, Xi’an Jiaotong University, Xi’an 710049, China
  • Revised:2023-05-06 Online:2023-05-25 Published:2023-05-01
  • Supported by:
    The National Natural Science Foundation of China(61925105);Tsinghua University-China Mobile Com-munications Group Co., Ltd.Joint Institute

Abstract:

With the cross-integration of artificial intelligence and communications, technologies for processing multimodal data such as text, image, audio, and video are booming, the shared dimension of modal semantics is deeply excavated, and the characteristics of multimodal semantic information such as high abstraction, intelligence and simplicity are being fully utilized, which brings new ideas and means to semantic communications.First, the fundamental theories and classifications of semantic communication were introduced, and the research status of single-modal semantic communication was reviewed for text, image, audio, and video respectively.Then, the research status of multimodal semantic communication was reviewed, and multimodal data fusion technology and secure semantic communication were introduced.Finally, the challenges faced by multimodal semantic communication were summarized.

Key words: semantic communication, multimodal data fusion, multimodal semantic communication

CLC Number: 

No Suggested Reading articles found!