Step Size Research Articles

In biodiversity research, the integration of machine learning and data visualization is increasingly important for uncovering valuable insights from academic literature. This study introduces an innovative knowledge graph application, BiodiViz, designed to translate intricate text into intuitive visual representations, fostering a deeper comprehension of biodiversity relationships. BiodiViz uses the top-performing Named Entity Recognition (NER) and Relation Extraction (RE) models to automatically generate a comprehensive knowledge graph for biodiversity research. The NER model extracts and categorizes entities like organisms, phenomena, and habitats, while the RE model identifies relationships such as "have," "occur in," and "influence" from the BiodivNERE dataset (Abdelmageed et al. 2022). These entities and relationships are organized into nodes and edges within a graph. Researchers input text into BiodiViz, producing a visual knowledge graph that simplifies the analysis of complex biodiversity data, reducing manual effort and enhancing efficiency. Named Entity Recognition & Relation Extraction BiodiViz leverages advanced Bidirectional Encoder Representations from Transformers (BERT)-based Large Language Models (LLMs) (Rogers et al. 2020), fine-tuned specifically for NER and RE tasks using the BiodivNERE dataset. The fine-tuning process involved various models, including BERT (Devlin et al. 2019), ELECTRA (Clark et al. 2020), and BiodivBERT (Abdelmageed et al. 2023). These models were evaluated for performance using the results of their F1-score as the main metric, which is the harmonic mean of precision (the proportion of true positive results among all positive predictions) and recall (the proportion of true positive results among all actual positives), with BiodivBERT achieving an F1-score of 77.16% for the NER task, while BERT excelled in the RE task with an F1-score of 81.28%. Rigorous hyperparameter optimization further enhanced the performance of BiodivBERT in the RE task by 3.38%. The BiodivNERE corpora by Abdelmageed et al. (2022) were used to fine-tune several models for NER and RE tasks in the biodiversity domain. The first corpus from the BiodivNERE corpora is BiodivNER, which is a gold standard dataset (manually labelled test corpora) for evaluating NER tasks. The fine-tuning process employed the token classification method from the Hugging Face library (Hugging Face 2023b), which assigns labels to each token in a sequence. Experiments were conducted with a batch size of four, meaning the model processes four examples/rows of data at a time before making an update to improve its learning. This is due to the constraints of the NVIDIA® GeForce RTX™ 3060 graphics processor. (NVIDIA 2024) Model performance was evaluated using the seqeval library (Nakayama 2018), focusing on accuracy, precision, recall, and F1 scores. For text classification, the second corpus, BiodivRE, was utilized, following previous research recommendations to explore fine-tuning settings for BiodivBERT. Hyperparameter optimization (Feurer and Hutter 2019) was conducted using Hugging Face’s Trainer API with an Optuna backend (Hugging Face 2023a), concentrating on learning rate and the number of training epochs (i.e., the number of complete passes through the entire dataset during model training). The BiodiViz Knowledge Graph Application The fine-tuned NER and RE models with the best F1-scores—BiodivBERT and BERT, respectively—were integrated into the knowledge graph application. Fig. 1 illustrates the flowchart of the application pipeline. Each sentence in the input text will go through the NER model to identify and label the entities within the sentence. Subsequently, these labeled entities, together with the original sentence, will be input into the RE model. The RE model will analyze every pair of entities for a potential relation and output the type of relation they share. The application will then utilize this data to create a graph with appropriate labels and color-coding. An example of the application's user interface with the knowledge graph is shown in Fig. 2. This study highlights the practical application of machine learning and data visualization in advancing biodiversity research, emphasizing the importance of developing user-friendly tools to support scientific exploration and discovery. The BiodiViz application, including the code and resources, is available on GitHub*1, providing an accessible tool for biodiversity researchers to streamline their analyses.

Non-small cell lung cancer (NSCLC) is one of the leading causes of cancer mortality worldwide. Immune checkpoint inhibitors (ICIs) have emerged as a crucial treatment option for patients with advanced NSCLC. However, only a subset of patients experience clinical benefit from ICIs. Therefore, identifying biomarkers that can predict response to ICIs is imperative for optimising patient selection. Hematoxylin and eosin (H&E) images of NSCLC patients were obtained from the local cohort (n = 106) and The Cancer Genome Atlas (TCGA) (n = 899). We developed an ICI-related pathological prognostic signature (ir-PPS) based on H&E stained histopathology images to predict prognosis in NSCLC patients treated with ICIs using deep learning. To accomplish this, we employed a modified ResNet model (ResNet18-PG), a widely-used deep learning architecture well-known for its effectiveness in handling complex image recognition tasks. Our modifications include a progressive growing strategy to improve the stability of model training and the use of the AdamW optimiser, which enhances the optimisation process by adjusting the learning rate based on training dynamics. The deep learning model, ResNet18-PG, achieved an area under the receiver operating characteristic curve (AUC) of 0.918 and a recall of 0.995 on the local cohort. The ir-PPS effectively risk-stratified NSCLC patients. Patients in the low-risk group (n = 40) had significantly improved progression-free survival (PFS) after ICI treatment compared to those in the high-risk group (n = 66, log-rank P = 0.004, hazard ratio (HR) = 3.65, 95%CI: 1.75-7.60). The ir-PPS demonstrated good discriminatory power for predicting 6-month PFS (AUC = 0.750), 12-month PFS (AUC = 0.677), and 18-month PFS (AUC = 0.662). The low-risk group exhibited increased expression of immune checkpoint molecules, cytotoxicity-related genes, an elevated abundance of tumour-infiltrating lymphocytes, and enhanced activity in immune stimulatory pathways. The ir-PPS signature derived from H&E images using deep learning could predict ICIs prognosis in NSCLC patients. The ir-PPS provides a novel imaging biomarker that may help select optimal candidates for ICIs therapy in NSCLC.

Step Size Research Articles

Related Topics

Articles published on Step Size

Plant Disease Detection Using Machine Learning

BiodiViz: Leveraging NER and RE for Automated Knowledge Graph Generation in Biodiversity Research

Adaptive time step selection for spectral deferred correction

Influence of Physical Characteristics of Obstacles on the Locomotor Pattern of Older Adults at Higher Risk of Falling

Photovoltaic Maximum Power Point Tracking Technology Based on Power Prediction Algorithm Combined with Variable Step Length Disturbance Observation Method

The Development Mask R-CNN Model for Identification of Melon Plant Leaves and Branches

Internal Combustion Engine Fault Detection Based on Random Convolutional Neural Networks

Deep Convolutional Neural Network for Accurate Classification of Myofibroblastic Lesions on Patch-Based Images.

High quality and sensitivity phononic crystal channel drop filter to detect ethyl lactate in mixtures of ethyl lactate and 2-butoxy ethanol

Matrix Factorization and Prediction for High-Dimensional Co-Occurrence Count Data via Shared Parameter Alternating Zero Inflated Gamma Model

The effect of functional exercise program on physical functioning in older adults aged 60 years or more: A systematic review and meta-analysis of randomized controlled trials

Indetermsoft-Set-Based D* Extra Lite Framework for Resource Provisioning in Cloud Computing

Neo-Hookean modeling of nonlinear coupled behavior in circular plates supported by micro-pillars

Dynamic step opposition-based learning sparrow search algorithm for UAV path planning

A multi-objective indirect neural adaptive processes control design for minimization of energy consumption: An experimental validation on a transesterification reactor

Deep learning analysis of histopathological images predicts immunotherapy prognosis and reveals tumour microenvironment features in non-small cell lung cancer.

Optimizing rehabilitation strategies in Parkinson’s disease: a comparison of dual cognitive-walking treadmill training and single treadmill training

Exploring age-related differences in the relationship between spatial and temporal contributions to step length asymmetry during split-belt adaptation.

6G Self‐Evolution Network for IoT Using Rainbow Deep Q‐Network Based on Decision‐Making

Multiphasic movement and step-selection patterns of dispersed tigers in the central Indian landscape.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Step Size Research Articles

Related Topics

Articles published on Step Size

Plant Disease Detection Using Machine Learning

BiodiViz: Leveraging NER and RE for Automated Knowledge Graph Generation in Biodiversity Research

Adaptive time step selection for spectral deferred correction

Influence of Physical Characteristics of Obstacles on the Locomotor Pattern of Older Adults at Higher Risk of Falling

Photovoltaic Maximum Power Point Tracking Technology Based on Power Prediction Algorithm Combined with Variable Step Length Disturbance Observation Method

The Development Mask R-CNN Model for Identification of Melon Plant Leaves and Branches

Internal Combustion Engine Fault Detection Based on Random Convolutional Neural Networks

Deep Convolutional Neural Network for Accurate Classification of Myofibroblastic Lesions on Patch-Based Images.

High quality and sensitivity phononic crystal channel drop filter to detect ethyl lactate in mixtures of ethyl lactate and 2-butoxy ethanol

Matrix Factorization and Prediction for High-Dimensional Co-Occurrence Count Data via Shared Parameter Alternating Zero Inflated Gamma Model

The effect of functional exercise program on physical functioning in older adults aged 60 years or more: A systematic review and meta-analysis of randomized controlled trials

Indetermsoft-Set-Based D* Extra Lite Framework for Resource Provisioning in Cloud Computing

Neo-Hookean modeling of nonlinear coupled behavior in circular plates supported by micro-pillars

Dynamic step opposition-based learning sparrow search algorithm for UAV path planning

A multi-objective indirect neural adaptive processes control design for minimization of energy consumption: An experimental validation on a transesterification reactor

Deep learning analysis of histopathological images predicts immunotherapy prognosis and reveals tumour microenvironment features in non-small cell lung cancer.

Optimizing rehabilitation strategies in Parkinson’s disease: a comparison of dual cognitive-walking treadmill training and single treadmill training

Exploring age-related differences in the relationship between spatial and temporal contributions to step length asymmetry during split-belt adaptation.

6G Self‐Evolution Network for IoT Using Rainbow Deep Q‐Network Based on Decision‐Making

Multiphasic movement and step-selection patterns of dispersed tigers in the central Indian landscape.