Abstract

One of the problems of e-business is to find relevant documents for making correct decisions. The main problem of the Internet is the huge amount of documents, which makes it difficult to find the relevant ones, hence the importance of the methods allowing for improving the quality of document retrieval. We discuss some linguistic problems of document retrieval on the internet related to the following natural language phenomena: (1) morphological processes: e.g., takes, took, taken are grammar forms of take; (2) polysemy and homonymy: most words have several senses, e.g., bank is a financial institution, shore, bench, etc.; (3) non-linearity of syntactic relations: in the case of a query that contains word combinations, the words forming a word combination can be separated by other words in the documents. Some linguistic-based methods and strategies related to the discussed problems are proposed that improve the quality of document retrieval or show the necessity of application of linguistic methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.