Using nearest‐neighbour searching techniques to access full‐text documents

Suliman Al‐Hawamdeh,Rachel De Vere,Geoff Smith,Peter Willett

doi:10.1108/eb024372

Abstract

Full‐text documents are usually searched by means of a Boolean retrieval algorithm that requires the user to specify the logical relationships between the terms of a query. In this paper, we summarise the results to date of a continuing programme of research at the University of Sheffield to investigate the use of nearest‐neighbour retrieval algorithms for full‐text searching. Given a natural‐language query statement, our methods result in a ranking of the paragraphs comprising a full‐text document in order of decreasing similarity with the query, where the similarity for each paragraph is determined by the number of keyword stems that it has in common with the query. A full‐text document test collection has been created to allow systematic tests of retrieval effectiveness to be carried out. Experiments with this collection demonstrate that nearest‐neighbour searching provides a means for paragraph‐based access to full‐text documents that is of comparable effectiveness to both Boolean and hypertext searching and that index term weighting schemes which have been developed for the searching of bibliographical databases can also be used to improve the effectiveness of retrieval from full‐text databases. A current project is investigating the extent to which a paragraph‐based full‐text retrieval system can be used to augment the explication facilities of an expert system on welding.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using nearest‐neighbour searching techniques to access full‐text documents

Abstract

Talk to us

Similar Papers

More From: Online Review

Lead the way for us

Journal: Online Review	Publication Date: Mar 1, 1991
Citations: 5

Similar Papers

Techniques for the measurement of clustering tendency in document retrieval systems
Abdelmoula El-Hamdouchi ... Peter Willett
Journal of Information Science | VOL. 13
Abdelmoula El-Hamdouchi, et. al.Abdelmoula El-Hamdouchi ... Peter Willett
01 Dec 1987
Journal of Information Science | VOL. 13

Query-time optimization techniques for structured queries in information retrieval
...
-
, et. al. ...
25 Nov 2013
25 Nov 2013

Image retrieval: Benchmarking visual information indexing and retrieval systems
Abebe Rorissa
Bulletin of the American Society for Information Science and Technology | VOL. 33
Abebe RorissaAbebe Rorissa
01 Feb 2007
Bulletin of the American Society for Information Science and Technology | VOL. 33

Implicit feedback for interactive information retrieval
Ryen W White
ACM SIGIR Forum | VOL. 39
Ryen W WhiteRyen W White
01 Jun 2005
ACM SIGIR Forum | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using nearest‐neighbour searching techniques to access full‐text documents

Abstract

Talk to us

Similar Papers

More From: Online Review