Semantic Representation Of Sentence Research Articles

This paper develops a model that addresses sentence embedding, a hot topic in current natural language processing research, using recurrent neural networks with Long Short-Term Memory (LSTM) cells. Due to its ability to capture long term memory, the LSTM-RNN accumulates increasingly richer information as it goes through the sentence, and when it reaches the last word, the hidden layer of the network provides a semantic representation of the whole sentence. In this paper, the LSTM-RNN is trained in a weakly supervised manner on user click-through data logged by a commercial web search engine. Visualization and analysis are performed to understand how the embedding process works. The model is found to automatically attenuate the unimportant words and detects the salient keywords in the sentence. Furthermore, these detected keywords are found to automatically activate different cells of the LSTM-RNN, where words belonging to a similar topic activate the same cell. As a semantic representation of the sentence, the embedding vector can be used in many different applications. These automatic keyword detection and topic allocation abilities enabled by the LSTM-RNN allow the network to perform document retrieval, a difficult language processing task, where the similarity between the query and documents can be measured by the distance between their corresponding sentence embedding vectors computed by the LSTM-RNN. On a web search task, the LSTM-RNN embedding is shown to significantly outperform several existing state of the art methods. We emphasize that the proposed model generates sentence embedding vectors that are specially useful for web document retrieval tasks. A comparison with a well known general sentence embedding method, the Paragraph Vector, is performed. The results show that the proposed method in this paper significantly outperforms it for web document retrieval task.

Read full abstract

INTRODUCTION: The study focuses on how cortical activations are modulated by the processing of different linguistic units when people perform semantic language tasks. From the perspective of cognitive science, semantic activation entails a process of combining the meanings of small parts (units) into a large whole (4). Brain mapping data indicated a significant difference in semantic retrieval of Chinese single characters and phrases (1, 3). Here we hypothesize that distinct brain areas are recruited in processing different linguistic units semantically (2). We used sentences and characters in this study to verify this hypothesis. METHODS: Nine native Chinese speakers participated. There were two experimental tasks, character semantic judgment and sentential semantic judgment. Each task was contrasted with its own baseline which controlled for the influence of processing efforts. The study was performed on a 2 T GE/Elscint Prestige whole-body scanner. A T2*-weighted gradient-echo EPI sequence was used, with the slice thickness = 6 mm, in-plane resolution = 2.9mm x 2.9mm, and TR/TE/FA= 2000 ms/45 ms/90 degree. Eighteen contiguous axial slices were acquired to cover the whole brain. SPM99 was used for image analysis. RESULTS AND CONCLUSION: Areas common to semantic processing of characters and sentences included left infeo-frontal gyrus (BA 45 and 47). Left middle temporal and precuneus gyri (BA 19, 39, 37) contributed to semantic representation of sentences, but not characters. Other areas mediating the semantic processing of sentences were cuneus, lingual gyrus, pre-and post-central gyri, supero-medial frontal cortex, and cingulate. Our fMRI results showed that while left inferior frontal cortex is responsible for activation of semantic valence of language stimuli in general, meaning processing of large linguistic units recruits cortical areas that are not active in processing simple language units.

Read full abstract

Semantic Representation Of Sentence Research Articles

Related Topics

Articles published on Semantic Representation Of Sentence

A deep learning approach to recognizing fine-grained expressway location reference from unstructured texts in Chinese

Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation

The Complexity of Semantic Components in Linguistic Era

Densely Enhanced Semantic Network for Conversation System in Social Media

TreeLSTM with tag-aware hypernetwork for sentence representation

A Deep Paraphrase Identification Model Interacting Semantics with Syntax

Learning semantic sentence representations from visually grounded language without lexical knowledge

Apps-based Machine Translation on Smart Media Devices - A Review

Multi-Turn Response Selection for Chatbots With Hierarchical Aggregation Network of Multi-Representation

Leveraging syntactic and semantic graph kernels to extract pharmacokinetic drug drug interactions from biomedical literature.

Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

Effect of the Compatibility between the Noun Phrase and the Verb in Forming Semantic Representations of Sentences

Language

An approach to natural-language semantics in logic programming

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semantic Representation Of Sentence Research Articles

Related Topics

Articles published on Semantic Representation Of Sentence

A deep learning approach to recognizing fine-grained expressway location reference from unstructured texts in Chinese

Attention-based Stacked Bidirectional Long Short-term Memory Model for Word Sense Disambiguation

The Complexity of Semantic Components in Linguistic Era

Densely Enhanced Semantic Network for Conversation System in Social Media

TreeLSTM with tag-aware hypernetwork for sentence representation

A Deep Paraphrase Identification Model Interacting Semantics with Syntax

Learning semantic sentence representations from visually grounded language without lexical knowledge

Apps-based Machine Translation on Smart Media Devices - A Review

Multi-Turn Response Selection for Chatbots With Hierarchical Aggregation Network of Multi-Representation

Leveraging syntactic and semantic graph kernels to extract pharmacokinetic drug drug interactions from biomedical literature.

Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

Effect of the Compatibility between the Noun Phrase and the Verb in Forming Semantic Representations of Sentences

Language

An approach to natural-language semantics in logic programming