A Big Data Text Coverless Information Hiding Based on Topic Distribution and TF-IDF

Jiaohua Qin,Yun Tan,Xuyu Xiang,Zhibin He,Zhuo Zhou

doi:10.4018/ijdcf.20210701.oa4

Abstract

Coverless information hiding has become a hot topic in recent years. The existing steganalysis tools are invalidated due to coverless steganography without any modification to the carrier. However, for the text coverless has relatively low hiding capacity, this paper proposed a big data text coverless information hiding method based on LDA (latent Dirichlet allocation) topic distribution and keyword TF-IDF (term frequency-inverse document frequency). Firstly, the sender and receiver build codebook, including word segmentation, word frequency and TF-IDF features, LDA topic model clustering. The sender then shreds the secret information, converts it into keyword ID through the keywords-index table, and searches the text containing the secret information keywords. Secondly, the searched text is taken as the index tag according to the topic distribution and TF-IDF features. At the same time, random numbers are introduced to control the keyword order of secret information.

Highlights

For the text coverless has relatively low hiding capacity, this paper proposed a big data text coverless information hiding method based on LDA topic distribution and keyword TF-IDF
This paper proposes a method of coverless text information hiding based on topic distribution and TF-IDF features mixed index
This paper proposes a coverless information hiding method based on LDA topic distribution and TFIDF feature mixed index of big data text

Summary

Introduction

This method used word rank map and word frequency of words as distance calculation to retrieve ordinary text containing secret information from text database to realize coverless information hiding This method has a low hiding capacity, and a Chinese character can only be hidden in a natural text. Chen et al (2015) proposed coverless information hiding technology based on mathematical expressions (Sun,2002, p.707) of Chinese characters in 2015 (2015, p.133) This method first extracted the secret information vector from the secret information, and retrieved a text containing the secret information vector based on the big data text, so as to achieve the purpose of hiding the secret information without any modification to the text. The hiding capacity has been improved, but it is still relatively small which is difficult to meet the actual demand

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Digital Crime and Forensics	Publication Date: Jul 1, 2021
Citations: 9	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

A Big Data Text Coverless Information Hiding Based on Topic Distribution and TF-IDF

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Digital Crime and Forensics

Lead the way for us

Similar Papers

CatchPhish: detection of phishing websites by inspecting URLs
Routhu Srinivasa Rao ... Tatti Vaishnavi
Journal of Ambient Intelligence and Humanized Computing | VOL. 11
Routhu Srinivasa Rao, et. al.Routhu Srinivasa Rao ... Tatti Vaishnavi
10 May 2019
Journal of Ambient Intelligence and Humanized Computing | VOL. 11

Structuring of Unstructured Data from Heterogeneous Sources
B L Shilpa ... B R Shambhavi
Indian Journal Of Science And Technology | VOL. 15
B L Shilpa, et. al.B L Shilpa ... B R Shambhavi
05 Nov 2022
Indian Journal Of Science And Technology | VOL. 15

SentiCon: A Concept Based Feature Set For Sentiment Analysis
Satanik Mitra ... Mamata Jenamani
-
Satanik Mitra, et. al.Satanik Mitra ... Mamata Jenamani
01 Dec 2018
01 Dec 2018

US Based COVID-19 Tweets Sentiment Analysis Using TextBlob and Supervised Machine Learning Algorithms
Rashid Khan ... Khadija Kanwal
-
Rashid Khan, et. al.Rashid Khan ... Khadija Kanwal
05 Apr 2021
05 Apr 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Big Data Text Coverless Information Hiding Based on Topic Distribution and TF-IDF

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Digital Crime and Forensics