通信学报 ›› 2023, Vol. 44 ›› Issue (5): 28-41.doi: 10.11959/j.issn.1000-436x.2023105

• 专题:多/跨模态语义通信 • 上一篇    下一篇

多模态语义通信研究综述

秦志金1, 赵菼菼2, 李凡2, 陶晓明1   

  1. 1 清华大学电子工程系,北京 100084
    2 西安交通大学信息与通信工程学院,陕西 西安 710049
  • 修回日期:2023-05-06 出版日期:2023-05-25 发布日期:2023-05-01
  • 作者简介:秦志金(1989- ),女,山西太原人,博士,清华大学副教授、博士生导师,主要研究方向为语义通信等
    赵菼菼(1991- ),女,甘肃陇南人,西安交通大学博士生,主要研究方向为无线安全传输、移动边缘计算、深度强化学习、联邦学习等
    李凡(1981- ),男,陕西宝鸡人,博士,西安交通大学教授、博士生导师,主要研究方向为基于深度学习的图像视频编码、基于机器学习的图像视频质量评价、图像视频的深度理解和处理等
    陶晓明(1981- ),女,河北石家庄人,博士,清华大学教授、博士生导师,主要研究方向为无线多媒体通信理论及关键技术应用等
  • 基金资助:
    国家自然科学基金资助项目(61925105);清华大学-中国移动联合研究院基金资助项目

Survey of research on multimodal semantic communication

Zhijin QIN1, Tantan ZHAO2, Fan LI2, Xiaoming TAO1   

  1. 1 Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
    2 School of Information and Communication Engineering, Xi’an Jiaotong University, Xi’an 710049, China
  • Revised:2023-05-06 Online:2023-05-25 Published:2023-05-01
  • Supported by:
    The National Natural Science Foundation of China(61925105);Tsinghua University-China Mobile Com-munications Group Co., Ltd.Joint Institute

摘要:

随着人工智能与通信的交叉融合,文本、图像、音频、视频等多模态数据处理技术蓬勃发展,模态语义的共享维度被深度挖掘,多模态语义信息的高度抽象、智能简约等特性被充分利用,为语义通信带来了全新的思路和手段。首先,介绍了语义通信的基础理论和分类,分别针对文本、图像、音频、视频综述了单模态语义通信的研究现状;然后,综述了多模态语义通信的研究现状,介绍了多模态数据融合技术和安全语义通信的研究;最后,总结了多模态语义通信面临的挑战。

关键词: 语义通信, 多模态数据融合, 多模态语义通信

Abstract:

With the cross-integration of artificial intelligence and communications, technologies for processing multimodal data such as text, image, audio, and video are booming, the shared dimension of modal semantics is deeply excavated, and the characteristics of multimodal semantic information such as high abstraction, intelligence and simplicity are being fully utilized, which brings new ideas and means to semantic communications.First, the fundamental theories and classifications of semantic communication were introduced, and the research status of single-modal semantic communication was reviewed for text, image, audio, and video respectively.Then, the research status of multimodal semantic communication was reviewed, and multimodal data fusion technology and secure semantic communication were introduced.Finally, the challenges faced by multimodal semantic communication were summarized.

Key words: semantic communication, multimodal data fusion, multimodal semantic communication

中图分类号: 

No Suggested Reading articles found!