Cross-Language Tourism News Retrieval System Using Google Translate API on SEBI Search Engine

H Husni,Zulfi Osman,Arif Muntasa,Sigit Susanto Putro

doi:10.21831/elinvo.v8i1.55851

Abstract

Cross-Language Information Retrieval (CLIR) is responsible for retrieving information stored in a language different from the language of the query provided by the user. Some translation methods commonly used in CLIR are Dictionary, Parallel corpora, Comparable corpora, Machine translator, Ontology, and Transitive-based. The query must be translated to the target language, followed by preprocessing and calculating the similarity between the query and all documents in the corpus. The problem is the time and accuracy of query translation. Moreover, the queries are not written as complete sentences according to certain language rules. Stemming, for example, every language has its own method. Indonesian has basic words and affixes in the form of prefixes, suffixes, infixes, and confixes, while English only has suffixes. Stemming takes a long time in text processing. In the Indonesian search engine (SEBI), the provision of cross-language tourism news retrieval is realized using the Google Translate API, which translates the Query and all documents into English, Porter's stemming technique to convert each term to its general form, and cosine similarity to calculate similarity. This approach can deliver cross-language tourism news instantly while increasing the precision and efficiency of the SEBI search engine, although some improvements are needed to provide a more precise and efficient similarity computation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross-Language Tourism News Retrieval System Using Google Translate API on SEBI Search Engine

Abstract

Talk to us

Similar Papers

More From: Elinvo (Electronics, Informatics, and Vocational Education)

Lead the way for us

Journal: Elinvo (Electronics, Informatics, and Vocational Education)	Publication Date: Jun 18, 2023
License type: CC BY-NC 4.0

Similar Papers

Cross-Language Information Retrieval
Jian-Yun Nie
-
Jian-Yun NieJian-Yun Nie
01 Jan 2009
01 Jan 2009

Sentence Alignment by Means of Cross-Language Information Retrieval
Marta R. ... Rafael E.
-
Marta R., et. al.Marta R. ... Rafael E.
21 Jun 2011
21 Jun 2011

Cross-language information retrieval models based on latent topic models trained with document-aligned comparable corpora
Ivan Vulić ... Marie-Francine Moens
Information Retrieval | VOL. 16
Ivan Vulić, et. al.Ivan Vulić ... Marie-Francine Moens
05 May 2012
Information Retrieval | VOL. 16

Studying machine translation technologies for large-data CLIR tasks: a patent prior-art search case study
Walid Magdy ... Gareth J F Jones
Information Retrieval | VOL. 17
Walid Magdy, et. al.Walid Magdy ... Gareth J F Jones
21 Nov 2013
Information Retrieval | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-Language Tourism News Retrieval System Using Google Translate API on SEBI Search Engine

Abstract

Talk to us

Similar Papers

More From: Elinvo (Electronics, Informatics, and Vocational Education)