Enhanced Natural Language Interface for Web-Based Information Retrieval

Tian Bai,Zhenting Zhang,Shuyu Guo,Yan Ge,Leiguang Gong

doi:10.1109/access.2020.3048164

Tian Bai, Zhenting Zhang + Show 3 more

Open Access

https://doi.org/10.1109/access.2020.3048164

Copy DOI

Abstract

Database application is at the core of most web application systems such as web-based email, source codes repository management, public scientific data repository management, news portals, and publication repository of various fields. However, the usage of these database systems for data and information retrieval is severely limited because of lacking support for processing search queries expressed in a natural language (NL). Most web interfaces for databases today only take search queries entered in some form of logical combination of keywords or text strings, which restrict the scope and depth of what a web user really wants to search for, even though natural language based data or information retrieval has made significant advances in recent years. To overcome or at least to alleviate such limitation in web information services, we propose in this article an improved neural model based on an existing framework IRNet for NL query of databases, in which a representation of Gated Graph Neural Network (GGNN) is introduced to encode the database entities and relations. We also represent and use the database values in the prediction model to identify and match table and column names for automatic synthesize a correct SQL statement from a query expressed in a NL sentence. Experiments with a public dataset demonstrates the promising potential of our approach.

Highlights

Nowadays database (DB) application is the backbone of most web-based information services such as web-based email, source codes repository management, public scientific data repository management, news portals, and publication repositories of various fields [1]–[3]
We introduced a representation of Gated Graph Neural Network (GGNN) [21], [22] to encode the DB schema replacing the original IRNet representation of DB schema
To show the value of maximizing the use of information embedded in relational databases in order to improve the prediction performance of a text to SQL (TTS) system, we have described following two new algorithmic components as extensions to the IRNet neural model: 1) Introducing database values into the model, computing the similarity between natural language or textual questions or queries and the database values, and establishing correlations between database values and column names through an Attention mechanism

Summary

INTRODUCTION

Nowadays database (DB) application is the backbone of most web-based information services such as web-based email, source codes repository management, public scientific data repository management, news portals, and publication repositories of various fields [1]–[3]. To most users without such knowledge and expertise, most likely they will not be able to take full advantage of the search tool for their data or information needs Such limitation can only be overcome or at least alleviated by a natural language interface with the support of NL query to SQL query (NL-SQL) or text to SQL (TTS) capabilities. Yu et al [17] proposed a large-scale, complex, and cross-domain Text-to-SQL dataset Spider containing databases of multiple tables. SyntaxSQLNet [18] is the first model developed for the Spider task using a syntax tree representing the features of the SQL queries It proposed a method for generating cross-domain training data to enhance model performance with data augmentation. Guo et al [23] propose a very interesting deep neural network based approach IRNet to tackle complex and cross-domain Text-to-SQL problems using Spider dataset.

METHODS

ENCODING DB SCHEMA WITH GRAPH NEURAL NETWORK

Merge the table and column vectors into a single node vector

DATASET

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Dec 30, 2020
Citations: 32	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Enhanced Natural Language Interface for Web-Based Information Retrieval

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A Rule-Based System Implementing a Method for Translating FOL Formulas into NL Sentences
Aikaterini Mpagouli ... Ioannis Hatzilygeroudis
-
Aikaterini Mpagouli, et. al.Aikaterini Mpagouli ... Ioannis Hatzilygeroudis
01 Jan 2009
01 Jan 2009

DG‐based SPO tuple recognition using self‐attention M‐Bi‐LSTM
Joon‐Young Jung
ETRI Journal | VOL. 44
Joon‐Young JungJoon‐Young Jung
29 Nov 2021
ETRI Journal | VOL. 44

A Web-Based Interactive System for Learning NL to FOL Conversion
Ioannis Hatzilygeroudis ... Isidoros Perikos
-
Ioannis Hatzilygeroudis, et. al.Ioannis Hatzilygeroudis ... Isidoros Perikos
01 Jan 2009
01 Jan 2009

A Knowledge-based System for Translating FOL Formulas into NL Sentences
Aikaterini Mpagouli ... Ioannis Hatzilygeroudis
-
Aikaterini Mpagouli, et. al.Aikaterini Mpagouli ... Ioannis Hatzilygeroudis
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced Natural Language Interface for Web-Based Information Retrieval

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access