Macro F1 Score Research Articles

BackgroundSocial and behavioral determinants of health (SBDH) are associated with a variety of health and utilization outcomes, yet these factors are not routinely documented in the structured fields of electronic health records (EHR). The objective of this study was to evaluate different machine learning approaches for detection of SBDH from the unstructured clinical notes in the EHR.MethodsLatent Semantic Indexing (LSI) was applied to 2,083,180 clinical notes corresponding to 46,146 patients in the MIMIC-III dataset. Using LSI, patients were ranked based on conceptual relevance to a set of keywords (lexicons) pertaining to 15 different SBDH categories. For Generative Pretrained Transformer (GPT) models, API requests were made with a Python script to connect to the OpenAI services in Azure, using gpt-3.5-turbo-1106 and gpt-4-1106-preview models. Prediction of SBDH categories were performed using a logistic regression model that included age, gender, race and SBDH ICD-9 codes.ResultsLSI retrieved patients according to 15 SBDH domains, with an overall average PPV ≥\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\ge$$\\end{document} 83%. Using manually curated gold standard (GS) sets for nine SBDH categories, the macro-F1 score of LSI (0.74) was better than ICD-9 (0.71) and GPT-3.5 (0.54), but lower than GPT-4 (0.80). Due to document size limitations, only a subset of the GS cases could be processed by GPT-3.5 (55.8%) and GPT-4 (94.2%), compared to LSI (100%). Using common GS subsets for nine different SBDH categories, the macro-F1 of ICD-9 combined with either LSI (mean 0.88, 95% CI 0.82-0.93), GPT-3.5 (0.86, 0.82-0.91) or GPT-4 (0.88, 0.83-0.94) was not significantly different. After including age, gender, race and ICD-9 in a logistic regression model, the AUC for prediction of six out of the nine SBDH categories was higher for LSI compared to GPT-4.0.ConclusionsThese results demonstrate that the LSI approach performs comparable to more recent large language models, such as GPT-3.5 and GPT-4.0, when using the same set of documents. Importantly, LSI is robust, deterministic, and does not have document-size limitations or cost implications, which make it more amenable to real-world applications in health systems.

ABSTRACTHistopathology, vital in diagnosing medical conditions, especially in cancer research, relies on analyzing histopathology images (HIs). Nuclei segmentation, a key task, involves precisely identifying cell nuclei boundaries. Manual segmentation by pathologists is time‐consuming, prompting the need for robust automated methods. Challenges in segmentation arise from HI complexities, necessitating advanced techniques. Recent advancements in deep learning, particularly Convolutional Neural Networks (CNNs), have transformed nuclei segmentation. This study emphasizes feature extraction, introducing the ConvNext Mixer‐based Encoder‐Decoder (CNM‐ED) model. Unlike traditional CNN based models, the proposed CNM‐ED model enables the extraction of spatial and long context features to address the inherent complexities of histopathology images. This method leverages a multi‐path strategy using a traditional CNN architecture as well as different paths focused on obtaining customized long context features using the ConvNext Mixer block structure that combines ConvMixer and ConvNext blocks. The fusion of these diverse features in the final segmentation output enables improved accuracy and performance, surpassing existing state‐of‐the‐art segmentation models. Moreover, our multi‐level feature extraction strategy is more effective than models using self‐attention mechanisms such as SwinUnet and TransUnet, which have been frequently used in recent years. Experimental studies were conducted using five different datasets (TNBC, MoNuSeg, CoNSeP, CPM17, and CryoNuSeg) to analyze the performance of the proposed CNM‐ED model. Comparisons were made with various CNN based models in the literature using evaluation metrics such as accuracy, AJI, macro F1 score, macro intersection over union, macro precision, and macro recall. It was observed that the proposed CNM‐ED model achieved highly successful results across all metrics. Through comparisons with state‐art‐of models from the literature, the proposed CNM‐ED model stands out as a promising advancement in nuclei segmentation, addressing the intricacies of histopathological images. The model demonstrates enhanced diagnostic capabilities and holds the potential for significant progress in medical research.

Macro F1 Score Research Articles

Articles published on Macro F1 Score

Enhancing Network Threat Detection with Random Forest-Based NIDS and Permutation Feature Importance

Classification of Corn Diseases and Pests Using Fuzzy Naïve Bayes Method

Identify devices and events from non-IP heterogeneous IoT network traffic

GASCOM: Graph-based Attentive Semantic Context Modeling for Online Conversation Understanding

A Comparative Study of Transformer-based Models for Hate-Speech Detection in English-Kiswahili Code-Switched Social Media Text

Large-scale identification of social and behavioral determinants of health from clinical notes: comparison of Latent Semantic Indexing and Generative Pretrained Transformer (GPT) models

Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes.

MeSHelper: Predicting the evolution of Medical Subject Headings based on knowledge graph dynamics

Unravelling sleep patterns: Supervised contrastive learning with self-attention for sleep stage classification

Text mining approach for feature extraction and cartilage disease grade classification using knee MRI radiology reports

CARROT: Simultaneous prediction of anomalies from groups of correlated cryptocurrency trends

Automatic sleep staging based on 24/7 EEG SubQ (UNEEG medical) data displays strong agreement with polysomnography in healthy adults

Text Analytics on YouTube Comments for Food Products

Electro-Stimulation System with Artificial-Intelligence-Based Auricular-Triggered Algorithm to Support Facial Movements in Peripheral Facial Palsy: A Simulation Pilot Study.

Multimodal reaching-position prediction for ADL support using neural networks

A novel method for rice identification: Coupling Raman spectroscopy with Fourier spectrum and analyzing with deep learning

Efficient diagnostic classification of diverse pathologies through contextual eye movement data analysis with a novel hybrid architecture

Can GPT-3.5 generate and code discharge summaries?

Detecting hate crimes through machine learning and natural language processing

ConvNext Mixer‐Based Encoder Decoder Method for Nuclei Segmentation in Histopathology Images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Macro F1 Score Research Articles

Articles published on Macro F1 Score

Enhancing Network Threat Detection with Random Forest-Based NIDS and Permutation Feature Importance

Classification of Corn Diseases and Pests Using Fuzzy Naïve Bayes Method

Identify devices and events from non-IP heterogeneous IoT network traffic

GASCOM: Graph-based Attentive Semantic Context Modeling for Online Conversation Understanding

A Comparative Study of Transformer-based Models for Hate-Speech Detection in English-Kiswahili Code-Switched Social Media Text

Large-scale identification of social and behavioral determinants of health from clinical notes: comparison of Latent Semantic Indexing and Generative Pretrained Transformer (GPT) models

Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes.

MeSHelper: Predicting the evolution of Medical Subject Headings based on knowledge graph dynamics

Unravelling sleep patterns: Supervised contrastive learning with self-attention for sleep stage classification

Text mining approach for feature extraction and cartilage disease grade classification using knee MRI radiology reports

CARROT: Simultaneous prediction of anomalies from groups of correlated cryptocurrency trends

Automatic sleep staging based on 24/7 EEG SubQ (UNEEG medical) data displays strong agreement with polysomnography in healthy adults

Text Analytics on YouTube Comments for Food Products

Electro-Stimulation System with Artificial-Intelligence-Based Auricular-Triggered Algorithm to Support Facial Movements in Peripheral Facial Palsy: A Simulation Pilot Study.

Multimodal reaching-position prediction for ADL support using neural networks

A novel method for rice identification: Coupling Raman spectroscopy with Fourier spectrum and analyzing with deep learning

Efficient diagnostic classification of diverse pathologies through contextual eye movement data analysis with a novel hybrid architecture

Can GPT-3.5 generate and code discharge summaries?

Detecting hate crimes through machine learning and natural language processing

ConvNext Mixer‐Based Encoder Decoder Method for Nuclei Segmentation in Histopathology Images