Stemming of French words based on grammatical categories

Jacques Savoy

doi:10.1002/(sici)1097-4571(199301)44:1<1::aid-asi1>3.0.co;2-1

Abstract

Automatic indexing systems use suffix stripping algorithms to cluster various words derived from a common root under the same stem. Currently, removing affixes to either a context-free or context-sensitive operation, where the context refers to the remaining stem. In this article, we propose a suffixing algorithm which uses grammatical categories to enhance the stemming process. This approach supports the use of foreign languages. In our case, the language is French, and a morphological analysis is required for removing inflectional suffixes or morphosyntactic variants of a lemma. After this analysis, we implement a suffix stripping algorithm which uses a dictionary and the grammatical categories to remove derivational suffixes. Our approach always returns a linguistically correct lemma, but not necessarily the “right” one. Based on our tests, this solution is an attractive one, with a mean error rate of 16%. We finish by explaining why we cannot expect significantly better results with this approach. © 1993 John Wiley & Sons, Inc.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stemming of French words based on grammatical categories

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science

Lead the way for us

Journal: Journal of the American Society for Information Science	Publication Date: Jan 1, 1993
Citations: 86

Similar Papers

Strategy for automatic person indexing and retrieval system in news interview video sequences
Sanghee Lee ... Kanghyun Jo
-
Sanghee Lee, et. al.Sanghee Lee ... Kanghyun Jo
01 Jul 2017
01 Jul 2017

Discriminant Feature Analysis for Music Timbre Recognition and Automatic Indexing
Xin Zhang ... Zbigniew W Raś
-
Xin Zhang, et. al.Xin Zhang ... Zbigniew W Raś
17 Sep 2007
17 Sep 2007

The automatic indexing system AIR/PHYS - from research to applications
P Biebricher ... N Fuhr
-
P Biebricher, et. al.P Biebricher ... N Fuhr
01 Jan 1987
01 Jan 1987

Automatic subject indexing using an associative neural network
Yi-Ming Chung ... Bruce R Schatz
-
Yi-Ming Chung, et. al.Yi-Ming Chung ... Bruce R Schatz
01 Jan 1998
01 Jan 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stemming of French words based on grammatical categories

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science