Novel Linguistic Steganography Based on Character-Level Text Generation

Lingyun Xiang,Qian Li,Chengzhang Zhu,Yuhang Liu,Shuanghui Yang

doi:10.3390/math8091558

Lingyun Xiang, Qian Li + Show 3 more

Open Access

PDF Available

https://doi.org/10.3390/math8091558

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

With the development of natural language processing, linguistic steganography has become a research hotspot in the field of information security. However, most existing linguistic steganographic methods may suffer from the low embedding capacity problem. Therefore, this paper proposes a character-level linguistic steganographic method (CLLS) to embed the secret information into characters instead of words by employing a long short-term memory (LSTM) based language model. First, the proposed method utilizes the LSTM model and large-scale corpus to construct and train a character-level text generation model. Through training, the best evaluated model is obtained as the prediction model of generating stego text. Then, we use the secret information as the control information to select the right character from predictions of the trained character-level text generation model. Thus, the secret information is hidden in the generated text as the predicted characters having different prediction probability values can be encoded into different secret bit values. For the same secret information, the generated stego texts vary with the starting strings of the text generation model, so we design a selection strategy to find the highest quality stego text from a number of candidate stego texts as the final stego text by changing the starting strings. The experimental results demonstrate that compared with other similar methods, the proposed method has the fastest running speed and highest embedding capacity. Moreover, extensive experiments are conducted to verify the effect of the number of candidate stego texts on the quality of the final stego text. The experimental results show that the quality of the final stego text increases with the number of candidate stego texts increasing, but the growth rate of the quality will slow down.

Highlights

Steganography is the art of hiding secret information within another public and innocuous medium, e.g., image [1,2], audio [3], video [4] or text [5,6], in an inconspicuous manner
Linguistic steganography based on text modification takes advantage of equivalent linguistic transformations to slightly modify the text content to hide the secret message while preserving the meaning of the original text
In order to improve the running speed of linguistic steganography based on deep learning and increase the length of secret information that can be embedded in each word, we propose a linguistic steganography method based on character-level text generation

Summary

Introduction

Steganography is the art of hiding secret information within another public and innocuous medium, e.g., image [1,2], audio [3], video [4] or text [5,6], in an inconspicuous manner. Both former types of linguistic steganography have the problem of low embedding capacity, but text generation-based linguistic steganography solves this problem well This type of method does not require an original text in advance. Yang et al [35] proposed a similar linguistic steganography method based on the recurrent neural network They designed two kinds of coding methods, fixed length and variable length, to encode each word in terms of the probability distribution of words for embedding information. In order to improve the running speed of linguistic steganography based on deep learning and increase the length of secret information that can be embedded in each word, we propose a linguistic steganography method based on character-level text generation.

Long Short-Term Memory Network

Text Generation-Based Linguistic Steganography

Character-Level Linguistic Steganography

Framework

Stego Text Generation Based on the LSTM-CLM

Information Hiding

Information Extraction

Experimental Setup

Performance Analysis

Methods

Impact Analysis Caused by the Number of Candidate Stego Texts

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Sep 11, 2020
Citations: 67	License type: CC BY 4.0

R Discovery Prime

Novel Linguistic Steganography Based on Character-Level Text Generation

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

A novel natural language steganographic framework based on image description neural network
Juan Wen ... Yiming Xue
Journal of Visual Communication and Image Representation | VOL. 61
Juan Wen, et. al.Juan Wen ... Yiming Xue
25 Mar 2019
Journal of Visual Communication and Image Representation | VOL. 61

Efficient X-box Mapping in Stego-image Using Four-bit Concatenation

-

01 Sep 2014
01 Sep 2014

Comparative Analysis of Five Modulus an Pictorial Block Algorithms For Data Hiding in Digital Images
...
-
, et. al. ...
01 Nov 2020
01 Nov 2020

New High Secure Network Steganography Method Based on Packet Length
...
-
, et. al. ...
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Novel Linguistic Steganography Based on Character-Level Text Generation

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Mathematics