Abstract

This paper explores a method for Korean text watermarking based on a linguistic analysis scheme using morphemic and syntactic analysis. In this scheme, a predicate nominal is separated into its nominal and its predicate, and syntactic adverbial is displaced. Korean, as an agglutinative language, provides a good basis for this morpheme-based natural language watermarking because a word consists of several morphemes. A Korean word usually consists of a content morpheme and a function morpheme. However, a predicate nominal is an exception, having two content morphemes-nominal and predicate--and one function morpheme. So, we can divide a predicate nominal into a nominal and a predicate. In addition, we also perform syntax-based watermarking. We displace syntactic adverbials using the characteristic that most languages permit displacement of syntactic adverbials within its clause. Combining these morphemic and syntactic characteristics, we propose a method of language watermarking based on syntactic displacement and morphological division. To make our system more secure, we also include a sentence weight value and encode the weight value with a watermark bit. Our watermarking method doesn't change the meaning of the most marked sentences, and it also ensures the naturalness of the sentences. From the experimental results, we show that the rate of unnatural sentences of marked text is reasonable, and the watermarking capacity is better than previous systems. The coverage of marked sentences is also reasonable. Experimental results also show that the marked text retains the same style, and also has the same information without semantic distortion.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.