Graph-based Text Representation Research Articles

Graph-based text representation is one of the important preprocessing steps in data and text mining, Natural Language Processing (NLP), and information retrieval approaches. The graph-based methods focus on how to represent text documents in the shape of a graph to exploit the best features of their characteristics. This study reviews and lists the advantages and disadvantages of such methods employed or developed in graph-based text representations. The literature shows that some of the proposed graph-based methods suffer from a lack of representing texts in certain situations. Currently, several techniques are commonly used in graph-based text representation. However, there are still some weaknesses and shortages in these techniques and tools that significantly affect the success of graph representation and graph matching. In this review, we conduct an inclusive survey of the state of the art in graph-based text representation and learning. We provide a formal description of the problem of graph-based text representation and introduce some basic concepts. More significantly, this study proposes a new taxonomy of graph-based text representation, categorizing the existing studies based on representation characteristics and scheme techniques. In terms of the representation scheme taxonomy, we introduce four main types of conceptual graph schemes and summarize the challenges faced in each scheme. The main issues of graph representation, such as research topics and the sub-taxonomy of graph models for web documents, are introduced and categorized. This research also covers some tasks of understanding natural language processing (NLP) that depend on different types of graph structures. In addition, the graph matching taxonomy implements three main categories based on the matching approach, including structural-, semantic-, and similarity-based approaches. Moreover, a deep comparison of these approaches is discussed and reported in terms of methods and tools, the concepts of matching and locality, and the application domains that use these tools. Finally, the paper recommends seven promising future study directions in the graph-based text representation field. These recommendation points are summarized and highlighted as open problems and challenges of graph-based text representation and learning to facilitate and fill the research gaps for scientific researchers in this field.

Read full abstract

Over the last few years, machine learning over graph structures has manifested a significant enhancement in text mining applications such as event detection, opinion mining, and news recommendation. One of the primary challenges in this regard is structuring a graph that encodes and encompasses the features of textual data for the effective machine learning algorithm. Besides, exploration and exploiting of semantic relations is regarded as a principal step in text mining applications. However, most of the traditional text mining methods perform somewhat poor in terms of employing such relations. In this paper, we propose a sentence-level graph-based text representation which includes stop words to consider semantic and term relations. Then, we employ a representation learning approach on the combined graphs of sentences to extract the latent and continuous features of the documents. Eventually, the learned features of the documents are fed into a deep neural network for the sentiment classification task. The experimental results demonstrate that the proposed method substantially outperforms the related sentiment analysis approaches based on several benchmark datasets. Furthermore, our method can be generalized on different datasets without any dependency on pre-trained word embeddings.

Read full abstract

Graph-based Text Representation Research Articles

Related Topics

Articles published on Graph-based Text Representation

TTG-Text: A Graph-Based Text Representation Framework Enhanced by Typical Testors for Improved Classification

GABSA-PT: Graph Neural Networks for Aspect-level Sentiment Analysis in Portuguese Language

Deep learning, graph-based text representation and classification: a survey, perspectives and challenges

Graph-Based Text Representation and Matching: A Review of the State of the Art and Future Challenges

Leveraging deep graph-based text representation for sentiment polarity applications

Graph-Based Text Representation: A Survey of Current Approaches

Research Trends on Graph-Based Text Mining

텍스트 마이닝을 위한 그래프 기반 텍스트 표현 모델의 연구 동향

GRAPH - BASED MODEL FOR TEXT REPRESENTATION

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Graph-based Text Representation Research Articles

Related Topics

Articles published on Graph-based Text Representation

TTG-Text: A Graph-Based Text Representation Framework Enhanced by Typical Testors for Improved Classification

GABSA-PT: Graph Neural Networks for Aspect-level Sentiment Analysis in Portuguese Language

Deep learning, graph-based text representation and classification: a survey, perspectives and challenges

Graph-Based Text Representation and Matching: A Review of the State of the Art and Future Challenges

Leveraging deep graph-based text representation for sentiment polarity applications

Graph-Based Text Representation: A Survey of Current Approaches

Research Trends on Graph-Based Text Mining

텍스트 마이닝을 위한 그래프 기반 텍스트 표현 모델의 연구 동향

GRAPH - BASED MODEL FOR TEXT REPRESENTATION