Natural Language Database Research Articles

The authors study the extraction of useful phrases from a natural language database by statistical methods. The aim is to leverage human effort by providing preprocessed phrase lists with a high percentage of useful material. The approach is to develop six different scoring methods that are based on different aspects of phrase occurrence. The emphasis here is not on lexical information or syntactic structure but rather on the statistical properties of word pairs and triples that can be obtained from a large database. The Unified Medical Language System (UMLS) incorporates a large list of humanly acceptable phrases in the medical field as a part of its structure. The authors use this list of phrases as a gold standard for validating their methods. A good method is one that ranks the UMLS phrases high among all phrases studied. Measurements are 11-point average precision values and precision-recall curves based on the rankings. The authors find of six different scoring methods that each proves effective in identifying UMLS quality phrases in a large subset of MEDLINE. These methods are applicable both to word pairs and word triples. All six methods are optimally combined to produce composite scoring methods that are more effective than any single method. The quality of the composite methods appears sufficient to support the automatic placement of hyperlinks in text at the site of highly ranked phrases. Statistical scoring methods provide a promising approach to the extraction of useful phrases from a natural language database for the purpose of indexing or providing hyperlinks in text.

Convergence is the phenomenon in human dialogue whereby participants adopt characteristics of each other's speech. Communicants are unaware of this occurring. If it were possible to invoke such a phenomenon in a natural language interface it would provide a means of keeping user inputs within the range of lexical and syntactic coverage of the system, while keeping the dialogue ‘natural’ in the sense of requiring no more conscious effort in observing conventions of format than human-human dialogue. A ‘Wizard of Oz’ study was conducted to test the feasibility of this technique. Subjects were required to type queries into what they thought was a natural language database querying system. On completion of input the system presented a paraphrase for confirmation by subjects before presenting the answer. The paraphrases were constructed using particular terms and syntactic structures. Subjects began to use these terms and structures spontaneously in subsequent queries. Observation of convergence in human-computer dialogue suggests that the technique can be incorporated in user interfaces to improve communication. The implementation issues for natural language dialogue are discussed, and other applications of the technique in HCI are outlined.

Natural Language Database Research Articles

Related Topics

Articles published on Natural Language Database

Corpus‐based statistical screening for content‐bearing terms

Natural language querying of databases: an information extraction approach in the conceptual query language

Corpus-based statistical screening for phrase identification.

Towards best practice in the development and evaluation of speech recognition components of a spoken language dialog system

Time, tense and aspect in natural language database interfaces

Metarepresentation in action: 3-, 4-, and 5-year-olds' developing theories of mind in parent-child conversations.

A trie compaction algorithm for a large set of keys

A method of compressing trie structures

Structuring medical information into a language-independent database.

Chapter 10 - WITHDRAWN: Natural Language

Generating and evaluating domain-oriented multi-word terms from texts

NAUDA

On mapping natural language constructs into relational algebra through E-R representation

A Comparison of Linear Keyword and Restricted Natural Language Data Base Interfaces for Novice Users

Natural language interface and database issues in applying expert systems to power systems

A measure of semantic relatedness for resolving ambiguities in natural language database requests

A neural net for extracting knowledge from natural language data bases

Exploiting convergence to improve natural language understanding

Transportable natural language database update

Interpretation of Natural Language Database Queries Using Optimization Methods

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Natural Language Database Research Articles

Related Topics

Articles published on Natural Language Database

Corpus‐based statistical screening for content‐bearing terms

Natural language querying of databases: an information extraction approach in the conceptual query language

Corpus-based statistical screening for phrase identification.

Towards best practice in the development and evaluation of speech recognition components of a spoken language dialog system

Time, tense and aspect in natural language database interfaces

Metarepresentation in action: 3-, 4-, and 5-year-olds' developing theories of mind in parent-child conversations.

A trie compaction algorithm for a large set of keys

A method of compressing trie structures

Structuring medical information into a language-independent database.

Chapter 10 - WITHDRAWN: Natural Language

Generating and evaluating domain-oriented multi-word terms from texts

NAUDA

On mapping natural language constructs into relational algebra through E-R representation

A Comparison of Linear Keyword and Restricted Natural Language Data Base Interfaces for Novice Users

Natural language interface and database issues in applying expert systems to power systems

A measure of semantic relatedness for resolving ambiguities in natural language database requests

A neural net for extracting knowledge from natural language data bases

Exploiting convergence to improve natural language understanding

Transportable natural language database update

Interpretation of Natural Language Database Queries Using Optimization Methods