Hindi Text Research Articles

Automatic text summarization (ATS) provides a summary of distinct categories of information using natural language processing (NLP). Low-resource languages like Hindi have restricted applications of these techniques. This study proposes a method for automatically generating summaries of Hindi documents using extractive technique. The approach retrieves pertinent sentences from the source documents by employing multiple linguistic features and machine learning (ML) using maximum likelihood estimation (MLE) and maximum entropy (ME). We conducted pre-processing on the input documents, such as eliminating Hindi stop words and stemming. We have obtained 15 linguistic feature scores from each document to identify the phrases with high scores for summary generation. We have performed experiments over BBC News articles, CNN News, DUC 2004, Hindi Text Short Summarization Corpus, Indian Language News Text Summarization Corpus, and Wikipedia Articles for the proposed text summarizer. The Hindi Text Short Summarization Corpus and Indian Language News Text Summarization Corpus datasets are in Hindi, whereas BBC News articles, CNN News, and the DUC 2004 datasets have been translated into Hindi using Google, Microsoft Bing, and Systran translators for experiments. The summarization results have been calculated and shown for Hindi as well as for English to compare the performance of a low and rich-resource language. Multiple ROUGE metrics, along with precision, recall, and F-measure, have been used for the evaluation, which shows the better performance of the proposed method with multiple ROUGE scores. We compare the proposed method with the supervised and unsupervised machine learning methodologies, including support vector machine (SVM), Naive Bayes (NB), decision tree (DT), latent semantic analysis (LSA), latent Dirichlet allocation (LDA), and K-means clustering, and it was found that the proposed method outperforms these methods.

The automated evaluation of pain is critical for developing effective pain management approaches that seek to alleviate while preserving patients’ functioning. Transformer-based models can aid in detecting pain from Hindi text data gathered from social media by leveraging their ability to capture complex language patterns and contextual information. By understanding the nuances and context of Hindi text, transformer models can effectively identify linguistic cues, sentiment and expressions associated with pain enabling the detection and analysis of pain-related content present in social media posts. The purpose of this research is to analyse the feasibility of utilizing NLP techniques to automatically identify pain within Hindi textual data, providing a valuable tool for pain assessment in Hindi-speaking populations. The research showcases the HindiPainNet model, a deep neural network that employs the IndicBERT model, classifying the dataset into two class labels {pain, no_pain} for detecting pain in Hindi textual data. The model is trained and tested using a novel dataset, दर्द-ए-शायरी (pronounced as Dard-e-Shayari ) curated using posts from social media platforms. The results demonstrate the model's effectiveness, achieving an accuracy of 70.5%. This pioneer research highlights the potential of utilizing textual data from diverse sources to identify and understand pain experiences based on psychosocial factors. This research could pave the path for the development of automated pain assessment tools that help medical professionals comprehend and treat pain in Hindi speaking populations. Additionally, it opens avenues to conduct further NLP-based multilingual pain detection research, addressing the needs of diverse language communities.

Hindi Text Research Articles

Related Topics

Articles published on Hindi Text

Enhanced word vector space with ensemble deep learning model for COVID-19 Hindi text sentiment analysis

Automatic Extractive Text Summarization using Multiple Linguistic Features

Hindi Abstractive Text Summarization using Transliteration with Pre-trained Model

AI unveiled personalities: Profiling optimistic and pessimistic attitudes in Hindi dataset using transformer‐based models

Am I hurt?: Evaluating Psychological Pain Detection in Hindi Text using Transformer-based Models

Classifying Hindi News Using Various Machine Learning and Deep Learning Techniques

Optimized Hybrid Model for COVID-19 Vaccine Sentiment Analysis for Hindi Text

Multi-Class Classification of Hindi Text using Machine Learning

HINDI TEXT CATEGORIZATION & TRANSLATION

A Novel Ensemble Model for Complex Entities Identification in Low Resource Language

LiHiSTO: a comprehensive list of Hindi stopwords

Hindi Text Summarization Using Sequence to Sequence Neural Network

HINDI FAKE NEWS DETECTOR

Identifying and Analyzing Reduplication Multiword Expressions in Hindi Text Using Machine Learning

Psquad: Plagiarism detection and document similarity of Hindi text

Hybrid Optimization Based Hindi Document Summarization Using Deep Learning Technique

Semi-automatic Annotation for Mentions in Hindi Text

Text Summarization Using Natural Language Processing

Engaging Advaita: Conceptualising Liberating Knowledge in the Face of Western Modernity. By Pawel Odyniec

Effect of Stemming on Hindi Text Classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Hindi Text Research Articles

Related Topics

Articles published on Hindi Text

Enhanced word vector space with ensemble deep learning model for COVID-19 Hindi text sentiment analysis

Automatic Extractive Text Summarization using Multiple Linguistic Features

Hindi Abstractive Text Summarization using Transliteration with Pre-trained Model

AI unveiled personalities: Profiling optimistic and pessimistic attitudes in Hindi dataset using transformer‐based models

Am I hurt?: Evaluating Psychological Pain Detection in Hindi Text using Transformer-based Models

Classifying Hindi News Using Various Machine Learning and Deep Learning Techniques

Optimized Hybrid Model for COVID-19 Vaccine Sentiment Analysis for Hindi Text

Multi-Class Classification of Hindi Text using Machine Learning

HINDI TEXT CATEGORIZATION &amp; TRANSLATION

A Novel Ensemble Model for Complex Entities Identification in Low Resource Language

LiHiSTO: a comprehensive list of Hindi stopwords

Hindi Text Summarization Using Sequence to Sequence Neural Network

HINDI FAKE NEWS DETECTOR

Identifying and Analyzing Reduplication Multiword Expressions in Hindi Text Using Machine Learning

Psquad: Plagiarism detection and document similarity of Hindi text

Hybrid Optimization Based Hindi Document Summarization Using Deep Learning Technique

Semi-automatic Annotation for Mentions in Hindi Text

Text Summarization Using Natural Language Processing

Engaging Advaita: Conceptualising Liberating Knowledge in the Face of Western Modernity. By Pawel Odyniec

Effect of Stemming on Hindi Text Classification

HINDI TEXT CATEGORIZATION & TRANSLATION