Mixed Language Arabic-English Information Retrieval

Mohammed Mustafa,Hussein Suleman

doi:10.1007/978-3-319-18117-2_32

Abstract

For many non-English languages in developing countries (such as Arabic), text switching/mixing (e.g. between Arabic and English) is very prevalent, especially in scientific domains, due to the fact that most technical terms are borrowed from English and/or they are neither included in the native (non-English) languages nor have a precise translation/transliteration in these native languages. This makes it difficult to search only in a non-English (native) language because either non-English-speaking users, such as Arabic speakers, are not able to express terminology in their native languages or the concepts need to be expanded using context. This results in mixed queries and documents in the non-English speaking world (the Arabic world in particular). Mixed-language querying is a challenging problem and does not attained major attention in IR community. Current search engines and traditional CLIR systems did not handle mixed-language querying adequately and did not exploit this natural human tendency. This paper attempts to address the problem of mixed querying in CLIR. It proposes mixed-language (language-aware) IR solution, in terms of cross-lingual re-weighting model, in which mixed queries are used to retrieve most relevant documents, regardless of their languages. For the purpose of conducting the experiments, a new multilingual and mixed Arabic-English corpus on the computer science domain is therefore created. Test results showed that the proposed cross-lingual re-weighting model could yield statistically significant better results, with respect to mixed-language queries and it achieved more than 94% of monolingual baseline effectiveness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mixed Language Arabic-English Information Retrieval

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Linguistic labour: International hospitality employees’ use of non-English native language in service encounters
David Williamson ... Tingting Chen
Hospitality Insights | VOL. 2
David Williamson, et. al.David Williamson ... Tingting Chen
25 Oct 2018
Hospitality Insights | VOL. 2

Non-native language proficiency may influence the responsiveness of bilingual parents towards young children with autism: A short report.
Jodie Smith ... Alexandra Woollacott
Autism & Developmental Language Impairments | VOL. 5
Jodie Smith, et. al.Jodie Smith ... Alexandra Woollacott
01 Jan 2020
Autism & Developmental Language Impairments | VOL. 5

421 Patterns of Non-English Language Use for Patient Care in a Public Emergency Department
B.R Taira ... S Samra
Annals of Emergency Medicine | VOL. 70
B.R Taira, et. al.B.R Taira ... S Samra
18 Sep 2017
Annals of Emergency Medicine | VOL. 70

Impact of Native Language, English Proficiency, and Language Concordance on Interpersonal Care During Evaluation of Acute Coronary Syndrome.
David S Edelman ... Emily K Romero
Journal of general internal medicine | VOL. 38
David S Edelman, et. al.David S Edelman ... Emily K Romero
20 Sep 2022
Journal of general internal medicine | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mixed Language Arabic-English Information Retrieval

Abstract

Talk to us

Similar Papers