Bilingual Evaluation Understudy Research Articles

Image captioning refers to the process of generating a textual description that describes objects and activities present in a given image. It connects two fields of artificial intelligence, computer vision, and natural language processing. Computer vision and natural language processing deal with image understanding and language modeling, respectively. In the existing literature, most of the works have been carried out for image captioning in the English language. This article presents a novel method for image captioning in the Hindi language using encoder–decoder based deep learning architecture with efficient channel attention. The key contribution of this work is the deployment of an efficient channel attention mechanism with bahdanau attention and a gated recurrent unit for developing an image captioning model in the Hindi language. Color images usually consist of three channels, namely red, green, and blue. The channel attention mechanism focuses on an image’s important channel while performing the convolution, which is basically to assign higher importance to specific channels over others. The channel attention mechanism has been shown to have great potential for improving the efficiency of deep convolution neural networks (CNNs). The proposed encoder–decoder architecture utilizes the recently introduced ECA-NET CNN to integrate the channel attention mechanism. Hindi is the fourth most spoken language globally, widely spoken in India and South Asia; it is India’s official language. By translating the well-known MSCOCO dataset from English to Hindi, a dataset for image captioning in Hindi is manually created. The efficiency of the proposed method is compared with other baselines in terms of Bilingual Evaluation Understudy (BLEU) scores, and the results obtained illustrate that the method proposed outperforms other baselines. The proposed method has attained improvements of 0.59%, 2.51%, 4.38%, and 3.30% in terms of BLEU-1, BLEU-2, BLEU-3, and BLEU-4 scores, respectively, with respect to the state-of-the-art. Qualities of the generated captions are further assessed manually in terms of adequacy and fluency to illustrate the proposed method’s efficacy.

Spam detection frequently categorizes product reviews as spam and non-spam. The spam reviews may contain texts of fake reviews and non-review statements describing unrelated things about products. Most of the publicly available spam reviews are labelled as fake reviews, while non-spam texts that are not fake reviews could contain non-review statements. It is crucial to notice those non-review statements since they convey misperception to consumers. Non-review statements are hardly found, and those statements of large and long texts often need to be manually labelled, which is time-consuming. Because of the rareness in finding non-review statements, there is an imbalanced condition between non-spam as a major class and spam that consists of the non-review statement as a minor class. Augmenting fake reviews to add spam texts is ineffective because they have similar content to non-spam such as some opinion words of product features. Thus, the text generation of non-review statements is preferable for adding spam texts. Some text generation issues are the frequent neural network-based methods require much learning data, and the existing pre-trained models produce texts with different contexts to non-review statements. The augmented texts should have similar content and context represented by the structure of the non-review statement. Therefore, we propose a text generation model with content and structure-based preprocessing to produce non-review statements, which is expected to overcome imbalanced data and give better spam detection results in product reviews. Structure-based preprocessing identifies the feature structures of non-opinion words from part-of-speech tags. Those features represent the context of spam reviews in unlabeled texts. Then, content-based preprocessing appoints selected topic modeling results of non-review statements from fake reviews. Our experiments resulted an improvement on the metric value of ± 0.04, called as BLEU (Bi-Lingual Evaluation Understudy) score, for the correspondence evaluation between generated and trained texts. The metric value indicates that the generated texts are not quite identical to the trained texts of non-review statements. However, those additional texts combined with the original spam texts gave better spam detection results with an increasing value of more than 40% on average recall score.

Bilingual Evaluation Understudy Research Articles

Articles published on Bilingual Evaluation Understudy

Efficient Channel Attention Based Encoder–Decoder Approach for Image Captioning in Hindi

Attention-Guided Image Captioning through Word Information.

A deep learning‐based image captioning method to automatically generate comprehensive explanations of bridge damage

Machine Translation in Low-Resource Languages by an Adversarial Neural Network

Low Resource Neural Machine Translation: Assamese to/from Other Indo-Aryan (Indic) Languages

Bangla↔English Machine Translation Using Attention-based Multi-Headed Transformer Model

Named Entity Correction in Neural Machine Translation Using the Attention Alignment Map

Minang and Indonesian Pharase-Based Statistical Machine Translation

Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition.

Cadlaws – An English–French Parallel Corpus of Legally Equivalent Documents

Optimization of paraphrase generation and identification using language models in natural language processing

POS-Tagging based Neural Machine Translation System for European Languages using Transformers

Research and Implementation of Chinese Couplet Generation System With Attention-Based Transformer Mechanism

Machine translation using deep learning for universal networking language based on their structure

Phrase Based Statistical Machine Translation Javanese-Indonesian

Statistical Machine Translation Dayak Language – Indonesia Language

Indian Sign Language Generation System

Text Generation with Content and Structure-Based Preprocessing in Imbalanced Data of Product Review

Phrase Table Combination Based on Symmetrization of Word Alignment for Low-Resource Languages

Distributional discrepancy: A metric for unconditional text generation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bilingual Evaluation Understudy Research Articles

Articles published on Bilingual Evaluation Understudy

Efficient Channel Attention Based Encoder–Decoder Approach for Image Captioning in Hindi

Attention-Guided Image Captioning through Word Information.

A deep learning‐based image captioning method to automatically generate comprehensive explanations of bridge damage

Machine Translation in Low-Resource Languages by an Adversarial Neural Network

Low Resource Neural Machine Translation: Assamese to/from Other Indo-Aryan (Indic) Languages

Bangla↔English Machine Translation Using Attention-based Multi-Headed Transformer Model

Named Entity Correction in Neural Machine Translation Using the Attention Alignment Map

Minang and Indonesian Pharase-Based Statistical Machine Translation

Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition.

Cadlaws – An English–French Parallel Corpus of Legally Equivalent Documents

Optimization of paraphrase generation and identification using language models in natural language processing

POS-Tagging based Neural Machine Translation System for European Languages using Transformers

Research and Implementation of Chinese Couplet Generation System With Attention-Based Transformer Mechanism

Machine translation using deep learning for universal networking language based on their structure

Phrase Based Statistical Machine Translation Javanese-Indonesian

Statistical Machine Translation Dayak Language – Indonesia Language

Indian Sign Language Generation System

Text Generation with Content and Structure-Based Preprocessing in Imbalanced Data of Product Review

Phrase Table Combination Based on Symmetrization of Word Alignment for Low-Resource Languages

Distributional discrepancy: A metric for unconditional text generation