A Word-Embedding-Based Steganalysis Method for Linguistic Steganography via Synonym Substitution

Lingyun Xiang,Xiaobo Shen,Jingmin Yu,Chunfang Yang,Daojian Zeng

doi:10.1109/access.2018.2878273

Lingyun Xiang, Xiaobo Shen + Show 3 more

Open Access

https://doi.org/10.1109/access.2018.2878273

Copy DOI

Abstract

The development of steganography technology threatens the security of privacy information in smart campus. To prevent privacy disclosure, a linguistic steganalysis method based on word embedding is proposed to detect the privacy information hidden in synonyms in the texts. With the continuous Skip-gram language model, each synonym and words in its context are represented as word embeddings, which aims to encode semantic meanings of words into low-dimensional dense vectors. The context fitness, which characterizes the suitability of a synonym by its semantic correlations with context words, is effectively estimated by their corresponding word embeddings and weighted by TF-IDF values of context words. By analyzing the differences of context fitness values of synonyms in the same synonym set and the differences of those in the cover and stego text, three features are extracted and fed into a support vector machine classifier for steganalysis task. The experimental results show that the proposed steganalysis improves the average F-value at least 4.8% over two baselines. In addition, the detection performance can be further improved by learning better word embeddings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2018
Citations: 50	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

A Word-Embedding-Based Steganalysis Method for Linguistic Steganography via Synonym Substitution

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Word Embeddings for Natural Language Processing

-

01 Jan 2015
01 Jan 2015

A rule-based/BPSO approach to produce low-dimensional semantic basis vectors set
Atefe Pakzad ... Morteza Analoui
Turkish Journal of Electrical Engineering and Computer Sciences | VOL. 30
Atefe Pakzad, et. al.Atefe Pakzad ... Morteza Analoui
01 Nov 2022
Turkish Journal of Electrical Engineering and Computer Sciences | VOL. 30

Reducing explicit word vectors dimensions using BPSO-based labeling algorithm and voting method
...
International Journal of Nonlinear Analysis and Applications | VOL. 12
, et. al. ...
01 Jul 2021
International Journal of Nonlinear Analysis and Applications | VOL. 12

Skip-Gram-KR: Korean Word Embedding for Semantic Clustering
Sun-Young Ihm ... Ji-Hye Lee
IEEE Access | VOL. 7
Sun-Young Ihm, et. al.Sun-Young Ihm ... Ji-Hye Lee
01 Jan 2019
IEEE Access | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Word-Embedding-Based Steganalysis Method for Linguistic Steganography via Synonym Substitution

Abstract

Talk to us

Similar Papers

More From: IEEE Access