External Knowledge Bases Research Articles

The task of multi-source web entity resolution (MSWER) aims to automatically discover entity references from multiple web sources that refer to the same real-world entity, which plays an important role in tasks such as question answering and recommendations. However, existing approaches typically suffer from three major limitations: (1) they usually treat the MSWER as an information retrieval task and focus on learning the similarity between entity references based on the associated features extracted from multiple sources; (2) they ignore the valuable implicit interactions between the associated features of different entities that cannot be directly captured based on the given data without any external knowledge; (3) they didn’t consider the redundant and noisy interactions between features. To overcome these limitations, this paper presents a novel attentive interaction-driven entity resolution model (AIDER). Our theme is to capture both the explicit and implicit interactions of features associated with entity references in the form of paths, and further develop an end-to-end entity resolution model for inferring the equivalent entity references. Accordingly, an external knowledge base is leveraged to construct paths for implicit interactions, and a well-designed attention mechanism is further employed to measure the importance of each path-based interaction, which focuses on useful interactions and neglects those redundant and noisy ones. Experimental results on three real-world datasets demonstrate that AIDER outperforms the state-of-the-art approaches.

Read full abstract

Text representation, a crucial step for text mining and natural language processing, concerns about transforming unstructured textual data into structured numerical vectors to support various machine learning and data mining algorithms. For document classification, one classical and commonly adopted text representation method is Bag-of-Words (BoW) model. BoW represents document as a fixed-length vector of terms, where each term dimension is a numerical value such as term frequency or tf-idf weight. However, BoW simply looks at surface form of words. It ignores the semantic, conceptual and contextual information of texts, and also suffers from high dimensionality and sparsity issues. To address the aforementioned issues, we propose a novel document representation scheme called Bag-of-Concepts (BoC), which automatically acquires useful conceptual knowledge from external knowledge base, then conceptualizes words and phrases in the document into higher level semantics (i.e. concepts) in a probabilistic manner, and eventually represents a document as a distributed vector in the learned concept space. By utilizing background knowledge from knowledge base, BoC representation is able to provide more semantic and conceptual information of texts, as well as better interpretability for human understanding. We also propose Bag-of-Concept-Clusters (BoCCl) model which clusters semantically similar concepts together and performs entity sense disambiguation to further improve BoC representation. In addition, we combine BoCCl and BoW representations using an attention mechanism to effectively utilize both concept-level and word-level information and achieve optimal performance for document classification.

Read full abstract

External Knowledge Bases Research Articles

Related Topics

Articles published on External Knowledge Bases

Keyphrase extraction from single textual documents based on semantically defined background knowledge and co-occurrence graphs

Keyphrase extraction from single textual documents based on semantically defined background knowledge and co-occurrence graphs

POI Classification Method Based on Feature Extension and Deep Learning

Find truth in the hands of the few: acquiring specific knowledge with crowdsourcing

Internal and External Sources of Knowledge in Manufacturing and Service Enterprises. A Comparative Analysis of European Union Countries

A Mathematical Model for Universal Semantics.

Gains or pains? Investigating effects of R&D collaboration intensity and technological diversification on new product development

Knowledge-Enhanced Graph Neural Networks for Sequential Recommendation

Heterogeneous classifier ensemble for sentiment analysis of Bengali and Hindi tweets

Incorporation of knowledge through acquisition in the pharmaceutical industry

A New Text Sentiment Analysis Method Based on Chinese Morphological Features and HowNet

Improving the robustness of machine reading comprehension model with hierarchical knowledge and auxiliary unanswerability prediction

Attentive interaction-driven entity resolution over multi-source web information

Knowledge-Graph Augmented Word Representations for Named Entity Recognition

Efficient Weighted Semantic Score Based on the Huffman Coding Algorithm and Knowledge Bases for Word Sequences Embedding

Bag-of-Concepts representation for document classification based on automatic knowledge acquisition from probabilistic knowledge base

Cross-Media Semantic Correlation Learning Based on Deep Hash Network and Semantic Expansion for Social Network Cross-Media Search.

Knowledge-Based Topic Model for Multi-Modal Social Event Analysis

Enhancing the Quality of Image Tagging Using a Visio-Textual Knowledge Base

Mining Entity Synonyms with Efficient Neural Set Generation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

External Knowledge Bases Research Articles

Related Topics

Articles published on External Knowledge Bases

Keyphrase extraction from single textual documents based on semantically defined background knowledge and co-occurrence graphs

Keyphrase extraction from single textual documents based on semantically defined background knowledge and co-occurrence graphs

POI Classification Method Based on Feature Extension and Deep Learning

Find truth in the hands of the few: acquiring specific knowledge with crowdsourcing

Internal and External Sources of Knowledge in Manufacturing and Service Enterprises. A Comparative Analysis of European Union Countries

A Mathematical Model for Universal Semantics.

Gains or pains? Investigating effects of R&D collaboration intensity and technological diversification on new product development

Knowledge-Enhanced Graph Neural Networks for Sequential Recommendation

Heterogeneous classifier ensemble for sentiment analysis of Bengali and Hindi tweets

Incorporation of knowledge through acquisition in the pharmaceutical industry

A New Text Sentiment Analysis Method Based on Chinese Morphological Features and HowNet

Improving the robustness of machine reading comprehension model with hierarchical knowledge and auxiliary unanswerability prediction

Attentive interaction-driven entity resolution over multi-source web information

Knowledge-Graph Augmented Word Representations for Named Entity Recognition

Efficient Weighted Semantic Score Based on the Huffman Coding Algorithm and Knowledge Bases for Word Sequences Embedding

Bag-of-Concepts representation for document classification based on automatic knowledge acquisition from probabilistic knowledge base

Cross-Media Semantic Correlation Learning Based on Deep Hash Network and Semantic Expansion for Social Network Cross-Media Search.

Knowledge-Based Topic Model for Multi-Modal Social Event Analysis

Enhancing the Quality of Image Tagging Using a Visio-Textual Knowledge Base

Mining Entity Synonyms with Efficient Neural Set Generation