Building Machine Learning System with Deep Neural Network for Text Processing

Shashi Pal Singh,Ajai Kumar,Nisheeth Joshi,Shikha Jain,Anshika Rastogi,Hemant Darbari

doi:10.1007/978-3-319-63645-0_56

Abstract

This paper provides the method and process to build machine learning system using Deep Neural Network (DNN) for lexicon analysis of text. Parts of Speech (POS) tagging of word is important in Natural language processing either it is speech technology or machine translation. The recent advancement of Deep Neural Network would help us to achieve better result in POS tagging of words and phrases. Word2vec tool of Dl4j library is very popular to represent the words in continuous vector space and these vectors capture the syntactic and semantic meaning of corresponding words. If we have a database of sample words with their POS category, it is possible to assign POS tag to the words but it fails when the word is not present in database. Cosine similarity concept plays an important role to find the POS Tags of the words and phrases which are not previously trained or POS Tagged. With the help of Cosine similarity, system assign the appropriate POS tags to the words by finding their nearest similar words using the vectors which we have trained from Word2vec database. Deep neural network like RNN outperforms as compare to traditional state of the art as it deals with the issue of word sense disambiguation. Semi-supervised learning is used to train the network. This approach can be applicable for Indian languages as well as for foreign languages. In this paper, RNN is implemented to build a machine learning system for POS-tagging of the words in English language sentences.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Building Machine Learning System with Deep Neural Network for Text Processing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Implementation of Kadazan Tagger Based on Brill's Method
Marylyn Alex ... Lailatul Qadri Zakaria
Journal of ICT Research and Applications | VOL. 7
Marylyn Alex, et. al.Marylyn Alex ... Lailatul Qadri Zakaria
01 Dec 2013
Journal of ICT Research and Applications | VOL. 7

Part of Speech Tagging for Tamil Language Using Deep Learning
Hemakasiny Visuwalingam ... Ratnasingam Sakuntharaj
-
Hemakasiny Visuwalingam, et. al.Hemakasiny Visuwalingam ... Ratnasingam Sakuntharaj
12 Sep 2021
12 Sep 2021

Improving Persian POS tagging using the maximum entropy model
Ahmad A Kardan ... Maryam Bahojb Imani
-
Ahmad A Kardan, et. al.Ahmad A Kardan ... Maryam Bahojb Imani
01 Feb 2014
01 Feb 2014

Combination of Genetic Algorithm and Brill Tagger Algorithm for Part of Speech Tagging Bahasa Madura
Nindian Puspa Dewi ... Ubaidi Ubaidi
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7
Nindian Puspa Dewi, et. al.Nindian Puspa Dewi ... Ubaidi Ubaidi
01 Oct 2020
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Building Machine Learning System with Deep Neural Network for Text Processing

Abstract

Talk to us

Similar Papers