An algorithm for suffix stripping

M.F Porter

doi:10.1108/00330330610681286

Abstract

PurposeThe automatic removal of suffixes from words in English is of particular interest in the field of information retrieval. This work was originally published in Program in 1980 and is republished as part of a series of articles commemorating the 40th anniversary of the journal.Design/methodology/approachAn algorithm for suffix stripping is described, which has been implemented as a short, fast program in BCPL.FindingsAlthough simple, it performs slightly better than a much more elaborate system with which it has been compared. It effectively works by treating complex suffixes as compounds made up of simple suffixes, and removing the simple suffixes in a number of steps. In each step the removal of the suffix is made to depend upon the form of the remaining stem, which usually involves a measure of its syllable length.Originality/valueThe piece provides a useful historical document on information retrieval.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An algorithm for suffix stripping

Abstract

Talk to us

Similar Papers

More From: Program

Lead the way for us

Similar Papers

An algorithm for suffix stripping
M.F Porter
Program | VOL. 14
M.F PorterM.F Porter
01 Mar 1980
Program | VOL. 14

Arabic Studies’ Progress in Information Retrieval
Essam Hanandeh ... Hayel Khafajah
International Journal of Advanced Computer Science and Applications | VOL. 7
Essam Hanandeh, et. al.Essam Hanandeh ... Hayel Khafajah
01 Jan 2015
International Journal of Advanced Computer Science and Applications | VOL. 7

Discrepancy-Based Method for Hierarchical Distributed Optimization
Jonathan Gaudreault ... Jean-Marc Frayret
-
Jonathan Gaudreault, et. al.Jonathan Gaudreault ... Jean-Marc Frayret
01 Oct 2007
01 Oct 2007

Document Length Normalization by Statistical Regression
Sylvain Lamprier ... Tassadit Amghar
-
Sylvain Lamprier, et. al.Sylvain Lamprier ... Tassadit Amghar
01 Oct 2007
01 Oct 2007

Journal: Program	Publication Date: Jul 1, 2006
Citations: 189

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An algorithm for suffix stripping

Abstract

Talk to us

Similar Papers

More From: Program