应用科学学报 ›› 2020, Vol. 38 ›› Issue (6): 976-985.doi: 10.3969/j.issn.0255-8297.2020.06.014

• 信号与信息处理 • 上一篇    

基于神经机器翻译的文本隐写方法

尉爽生1, 杨忠良1, 江旻宇2, 黄永峰1   

  1. 1. 清华大学 电子工程系, 北京 100084;
    2. 莱斯大学 计算机科学系, 美国 得克萨斯州 77005
  • 收稿日期:2019-11-29 发布日期:2020-12-08
  • 通信作者: 黄永峰,教授,博导,研究方向为互联网及多媒体通信安全.E-mail:yfhuang@mail.tsinghua.edu.cn E-mail:yfhuang@mail.tsinghua.edu.cn
  • 基金资助:
    国家重点研发计划基金(No.SQ2018YGX210002);国家自然科学基金(No.U1536201,No.U1636113,No.U1705261)资助

Text Steganography Based on Neural Machine Translation

YU Shuangsheng1, YANG Zhongliang1, JIANG Minyu2, HUANG Yongfeng1   

  1. 1. Department of Electronic Engineering, Tsinghua University, Beijing 100084, China;
    2. Department of Computer Science, Rice University, Texas 77005, United States
  • Received:2019-11-29 Published:2020-12-08

摘要: 深度学习促进了自然语言处理技术的发展,基于文本生成的信息隐藏方法表现出了巨大的潜力.为此,提出了一种基于神经机器翻译的文本信息隐藏方法,在翻译文本生成过程中进行信息嵌入.神经机器翻译模型使用集束搜索(Beam Search)解码器,在翻译过程中通过Beam Search得到目标语言序列各位置上的候选单词集合,并将候选单词依据概率排序进行编码;然后在解码输出目标语言文本的过程中,根据秘密信息的二进制比特流选择对应编码的候选单词,实现以单词为单位的信息嵌入.实验结果表明,与已有的基于机器翻译的文本信息隐藏方法相比,该方法在隐藏容量方面明显提升,并且具有良好的抗隐写检测性和安全性.

关键词: 文本隐写, 信息隐藏, 神经网络, 机器翻译

Abstract: Deep learning has promoted the development of natural language processing technology, and the information hiding methods based on text generation show great potential in this area. This paper proposes a text information hiding method based on neural machine translation, which embeds information in the process of generating translated text. The neural machine translation model uses a Beam Search decoder, which is used to obtain the candidate words in the sentence sequence in the translation process, and to encode the candidate words according to the probability ranking. Then, in the process of decoding and outputting target language texts, corresponding encoded candidate words are selected according to the binary bitstream of secret information, so as to realize information embedding at word level. Experimental results show that compared with the existing text information hiding methods based on machine translation, this method significantly improves the rate of information embedding, and shows good capability and security in anti-steganography performance.

Key words: text steganography, information hiding, neural network, machine translation

中图分类号: