Journal on Communications ›› 2024, Vol. 45 ›› Issue (2): 213-224.doi: 10.11959/j.issn.1000-436x.2024033

• Correspondences • Previous Articles    

Digital watermarking method based on context word prediction and window compression coding

Lingyun XIANG1,2, Minghao HUANG1, Chenling ZHANG1, Chunfang YANG3   

  1. 1 School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha 410114, China
    2 Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation, Changsha University of Science and Technology, Changsha 410114, China
    3 Henan Key Laboratory of Cyberspace Situation Awareness, Information Engineering University, Zhengzhou 450001, China
  • Revised:2023-11-19 Online:2024-02-01 Published:2024-02-01
  • Supported by:
    The National Natural Science Foundation of China(61972057);The National Natural Science Foundation of China(61872448);The Natural Science Foundation of Hunan Province(2022JJ30623)

Abstract:

To address the problems of limited number of substitutable words and low watermark extraction efficiency in the existing natural language digital watermarking methods, a creative method based on context word prediction and window compression coding was proposed.Firstly, the contextual semantic features of each word in the original text were automatically learned through a neural network language model, and then the candidate word set for each word was predicted, thus the number of substitutable words that could be utilized for carrying watermark information was expanded.Meanwhile, considering the difference of the semantic impact caused by the substitutions of candidate words at different positions, the watermark information was embedded into each window containing several words, and the selection of candidate words for watermark embedding was optimized by the similarity between sentences before and after performing word substitutions.Finally, a semantic-independent window compression coding method was proposed, which encoded each window as appointed watermark information in terms of the character information of words contained in the window.So that during watermark extraction, the dependence on the original context at the position of word substitution was eliminated.The experimental results show that the proposed method greatly improves the watermark extraction efficiency with high embedding capacity and text quality.

Key words: digital watermarking, word substitution, word prediction, watermarking coding

CLC Number: 

No Suggested Reading articles found!