Retrieving web search results using Max–Max soft clustering for Hindi query

Amita Jain,Devendra K Tayal,Sudesh Yadav

doi:10.1007/s13198-014-0307-5

Abstract

Information retrieval (IR) is the process of finding relevant information from the millions of unstructured documents on the web. Despite of all the success in IR, it faces many problems such as lexical ambiguity, compound word formation and language morphology etc. To address the ambiguity problem, in this paper the authors proposed a graph based soft clustering method which improves the performance of IR system. Initially text snippet words are taken for constructing a co-occurrence graph corresponding to the Hindi query given by a user. Then other words (relevant to the query terms) present in the text corpus are added on the basis of the dice coefficient. For each interpretation of the user query, we retrieve results in the form of a web cluster. Sometimes more than one interpretation of the query are closely related, therefore many results returned from IR corresponding to these interpretations are common. This type of issue can be better dealt by using soft clustering method, so we use Max–Max soft clustering approach. We use various similarity measures like word overlap, degree overlap, token overlap and average similarity respectively for ranking the results within each cluster. This is the first attempt to fuzzy IR for a query in Hindi language, experimental evaluations shows promising results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Retrieving web search results using Max–Max soft clustering for Hindi query

Abstract

Talk to us

Similar Papers

More From: International Journal of System Assurance Engineering and Management

Lead the way for us

Journal: International Journal of System Assurance Engineering and Management	Publication Date: Dec 2, 2014
Citations: 7

Similar Papers

Soft large margin clustering
Yunyun Wang ... Songcan Chen
Information Sciences | VOL. 232
Yunyun Wang, et. al.Yunyun Wang ... Songcan Chen
08 Jan 2013
Information Sciences | VOL. 232

Contents digest
-
Trends in Food Science & Technology | VOL. 6
--
01 Mar 1995
Trends in Food Science & Technology | VOL. 6

Soft clustering for information retrieval applications
Gloria Bordogna ... Gabriella Pasi
WIREs Data Mining and Knowledge Discovery | VOL. 1
Gloria Bordogna, et. al.Gloria Bordogna ... Gabriella Pasi
03 Feb 2011
WIREs Data Mining and Knowledge Discovery | VOL. 1

Lexical Ambiguity in Arabic Information Retrieval: The Case of Six Web-Based Search Engines
Abdulfattah Omar ... Mohammed Aldawsari
International Journal of English Linguistics | VOL. 10
Abdulfattah Omar, et. al.Abdulfattah Omar ... Mohammed Aldawsari
06 Apr 2020
International Journal of English Linguistics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Retrieving web search results using Max–Max soft clustering for Hindi query

Abstract

Talk to us

Similar Papers

More From: International Journal of System Assurance Engineering and Management