Abstract

The paper presents a system, which resolves pronominals in Tamil. In resolving the pronominals we adopt a probabilistic method. The analysis depends on the salience weight of the candidate noun phrases (NP) for the antecedent-hood of the pronominal from the list of possible candidate NPs. The salience weight of an NP is obtained from the salience factors, which are determined by the probability of an NP to be the antecedent on the basis of the grammatical features. The input text is pre-processed and partially parsed before sending it to the salience analyzer. The partial parsing is done using morphological markings and no sophisticated notions of the modern formal linguistic theories are used. The system considers candidate nouns that occur four sentences above the sentence in which the pronoun occurs. The system gives 86.32% precision and 80.9% recall

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.