Sentence Clustering Research Articles

BackgroundPharmaceutical companies are increasingly leveraging machine learning techniques to optimize healthcare research, drug development, and medical affairs activities. AI (artificial intelligence) tools such as chatbots, virtual digital assistants, and research tools have been explored to varying degrees of maturity in industries such as consumer goods or software technology. However, there continues to be untapped opportunities within the pharmaceutical industry to employ these technologies for enhanced engagement and education with healthcare professionals (HCPs). Pharmacists, situated at the crossroads of clinical sciences and innovation, have the potential to elevate their role and significance within the pharmaceutical industry by developing and leveraging such technologies.MethodsTo address this, the python-coded tool, Medical Information (MI) Data Uses For AI Semantic Analysis (MUFASA), utilizes state-of-the-art Sentence Transformer library, clustering, and visualization techniques. MUFASA harnesses unsolicited MI data with AI technology, improving efficiency and providing actionable medical affairs intelligence for targeted content delivery to HCPs.ResultsMUFASA optimizes medical affairs activities through its distinctive features: semantic search, cluster analysis, and visualization. Its proficiency in understanding inquiries, as demonstrated through 3D vector mapping and clustering tests, enhances the efficiency of MI and Medical Science Liaison (MSL) case handling. It proves invaluable in training new staff, bolstering response uniformity, and mitigating compliance risks. Leveraging the HDBSCAN algorithm, MUFASA's cluster analysis uncovers deep insights and discerns actionable themes from large inquiry data sets. The visualization graphs, generated from semantic searches, support evidence-based decisions by tracking the effectiveness of initiatives and monitoring trend shifts. Collectively, MUFASA enriches strategic decision-making, cultivates actionable insights, and bolsters healthcare professional engagement.ConclusionThere are numerous opportunities for innovation within the intersection of healthcare and data science. Pharmaceutical manufacturers, with one of their medical affairs responsibilities being the collection of unsolicited inquiries, particularly from HCPs, stand poised to leverage machine learning capabilities to optimize its processes. The abundance of data generated by the growing effort to use it in meaningful ways presents an opportunity for pharmaceutical companies to harness machine learning techniques.

Read full abstract

PurposeIt has been over a year since the first known case of coronavirus disease (COVID-19) emerged, yet the pandemic is far from over. To date, the coronavirus pandemic has infected over eighty million people and has killed more than 1.78 million worldwide. This study aims to explore “how useful is Reddit social media platform to surveil COVID-19 pandemic?” and “how do people’s concerns/behaviors change over the course of COVID-19 pandemic in North Carolina?”. The purpose of this study was to compare people’s thoughts, behavior changes, discussion topics, and the number of confirmed cases and deaths by applying natural language processing (NLP) to COVID-19 related data.MethodsIn this study, we collected COVID-19 related data from 18 subreddits of North Carolina from March to August 2020. Next, we applied methods from natural language processing and machine learning to analyze collected Reddit posts using feature engineering, topic modeling, custom named-entity recognition (NER), and BERT-based (Bidirectional Encoder Representations from Transformers) sentence clustering. Using these methods, we were able to glean people’s responses and their concerns about COVID-19 pandemic in North Carolina.ResultsWe observed a positive change in attitudes towards masks for residents in North Carolina. The high-frequency words in all subreddit corpora for each of the COVID-19 mitigation strategy categories are: Distancing (DIST)—“social distance/distancing”, “lockdown”, and “work from home”; Disinfection (DIT)—“(hand) sanitizer/soap”, “hygiene”, and "wipe"; Personal Protective Equipment (PPE)—“mask/facemask(s)/face shield”, “n95(s)/kn95”, and “cloth/gown”; Symptoms (SYM)—“death”, “flu/influenza”, and “cough/coughed”; Testing (TEST)—“cases”, “(antibody) test”, and “test results (positive/negative)”.ConclusionThe findings in our study show that the use of Reddit data to monitor COVID-19 pandemic in North Carolina (NC) was effective. The study shows the utility of NLP methods (e.g. cosine similarity, Latent Dirichlet Allocation (LDA) topic modeling, custom NER and BERT-based sentence clustering) in discovering the change of the public's concerns/behaviors over the course of COVID-19 pandemic in NC using Reddit data. Moreover, the results show that social media data can be utilized to surveil the epidemic situation in a specific community.

Read full abstract

Sentence Clustering Research Articles

Related Topics

Articles published on Sentence Clustering

Graphs in clusters: a hybrid approach to unsupervised extractive long document summarization using language models

Experimental study on short-text clustering using transformer-based semantic similarity measure

Discourse Coherence in English-Chinese Translation of Literary Texts: A Case Study on Tess of the D'Urbervilles Translated by Zhang Guruo

Unveiling Water Allocation Dynamics: A Text Analysis of 25 Years of Stakeholder Meetings

Explainable text-based features in predictive models of crowdfunding campaigns

Optimizing Text Summarization with Sentence Clustering and Natural Language Processing

Advancing medical affair capabilities and insight generation through machine learning techniques

Graph-Based Extractive Text Summarization Sentence Scoring Scheme for Big Data Applications

TEXT SUMMARIZATION IN MONGOLIAN LANGUAGE

Application and evaluation of sentence embedding and clustering methods in the context of concept hierarchy construction

Informing the development of an outcome set and banks of items to measure mobility among individuals with acquired brain injury using natural language processing.

A natural language processing approach towards harmonisation of European medicinal product information.

Automatic Text Summarization using Document Clustering Named Entity Recognition

Unsupervised Graph-Based Tibetan Multi-Document Summarization

SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning

Health Information Needs of Young Chinese People Based on an Online Health Community: Topic and Statistical Analysis.

SOCIAL MORALITY EDUCATION IN THE REJANG’S CULTURE OF "SERAMBEAK"

Monitoring COVID-19 pandemic through the lens of social media using natural language processing and machine learning.

Analysis of Particular Parameters of a Social Situation of the Development of a Pre-School Aged Child in Conditions of the Family and Pre-School Education Institutions

Research on the Natural Language Recognition Method Based on Cluster Analysis Using Neural Network

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sentence Clustering Research Articles

Related Topics

Articles published on Sentence Clustering

Graphs in clusters: a hybrid approach to unsupervised extractive long document summarization using language models

Experimental study on short-text clustering using transformer-based semantic similarity measure

Discourse Coherence in English-Chinese Translation of Literary Texts: A Case Study on Tess of the D'Urbervilles Translated by Zhang Guruo

Unveiling Water Allocation Dynamics: A Text Analysis of 25 Years of Stakeholder Meetings

Explainable text-based features in predictive models of crowdfunding campaigns

Optimizing Text Summarization with Sentence Clustering and Natural Language Processing

Advancing medical affair capabilities and insight generation through machine learning techniques

Graph-Based Extractive Text Summarization Sentence Scoring Scheme for Big Data Applications

TEXT SUMMARIZATION IN MONGOLIAN LANGUAGE

Application and evaluation of sentence embedding and clustering methods in the context of concept hierarchy construction

Informing the development of an outcome set and banks of items to measure mobility among individuals with acquired brain injury using natural language processing.

A natural language processing approach towards harmonisation of European medicinal product information.

Automatic Text Summarization using Document Clustering Named Entity Recognition

Unsupervised Graph-Based Tibetan Multi-Document Summarization

SSC: Clustering Of Turkish Texts By Spectral Graph Partitioning

Health Information Needs of Young Chinese People Based on an Online Health Community: Topic and Statistical Analysis.

SOCIAL MORALITY EDUCATION IN THE REJANG’S CULTURE OF "SERAMBEAK"

Monitoring COVID-19 pandemic through the lens of social media using natural language processing and machine learning.

Analysis of Particular Parameters of a Social Situation of the Development of a Pre-School Aged Child in Conditions of the Family and Pre-School Education Institutions

Research on the Natural Language Recognition Method Based on Cluster Analysis Using Neural Network