An analysis of ill-formed input in natural language queries to document retrieval systems

Charlene W Young,Caroline M Eastman,Robert L Oakman

doi:10.1016/0306-4573(91)90002-4

Abstract

We analyzed natural language document retrieval queries from the Thomas Cooper Library at the University of South Carolina in order to investigate the frequency of various types of ill-formed input, such as spelling errors, co-occurrence violations, conjunctions, ellipsis and missing or incorrect punctuation. The primary reason for analyzing ill-formed inputs was to determine whether there is a significant need to study ill-formed inputs in detail. After analyzing the queries, we found that most of the queries were sentence fragments and that many of them contained some type of ill-formed input. Conjunctions caused the most problems. The next most serious problem was caused by punctuation errors. Spelling errors occurred in a small number of the queries. The remaining types of ill-formed input considered, ellipsis and co-occurrence violations, were not found in the queries.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An analysis of ill-formed input in natural language queries to document retrieval systems

Abstract

Talk to us

Similar Papers

More From: Information Processing and Management

Lead the way for us

Journal: Information Processing and Management	Publication Date: Jan 1, 1991
Citations: 13

Similar Papers

Research and development in natural language understanding
Ralph Weischedel
-
Ralph WeischedelRalph Weischedel
01 Jan 1989
01 Jan 1989

Knowledge representation and natural language processing
R.M Weischedel
Proceedings of the IEEE | VOL. 74
R.M WeischedelR.M Weischedel
01 Jan 1986
Proceedings of the IEEE | VOL. 74

A rule-based approach to ill-formed input
Norman K Sondheimer ... Ralph M Weischedel
-
Norman K Sondheimer, et. al.Norman K Sondheimer ... Ralph M Weischedel
01 Jan 1980
01 Jan 1980

Capability based natural language understanding
A Jennings ... C D Rowles
-
A Jennings, et. al.A Jennings ... C D Rowles
01 Jan 1990
01 Jan 1990

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An analysis of ill-formed input in natural language queries to document retrieval systems

Abstract

Talk to us

Similar Papers

More From: Information Processing and Management