Increase In F1 Score Research Articles

BackgroundCircular RNA (circRNA), a class of RNA molecule with a loop structure, has recently attracted researchers due to its diverse biological functions and potential biomarkers of human diseases. Most of the current circRNA detection methods from RNA-sequencing (RNA-Seq) data utilize the mapping information of paired-end (PE) reads to eliminate false positives. However, much of the practical RNA-Seq data such as cross-linking immunoprecipitation sequencing (CLIP-Seq) data usually contain single-end (SE) reads. It is not clear how well these tools perform on SE RNA-Seq data.ResultsIn this study, we present a systematic evaluation of six advanced RNA-based methods and two CLIP-Seq based methods for detecting circRNAs from SE RNA-Seq data. The performances of the methods are rigorously assessed based on precision, sensitivity, F1 score, and true discovery rate. We investigate the impacts of read length, false positive ratio, sequencing depth and PE mapping information on the performances of the methods using simulated SE RNA-Seq simulated datasets. The real datasets used in this study consist of four experimental RNA-Seq datasets with ≥100bp read length and 124 CLIP-Seq samples from 45 studies that contain mostly short-read (≤50bp) RNA-Seq data. The simulation study shows that the sensitivities of most of the methods can be improved by increasing either read length or sequencing depth, and that the levels of false positive rates significantly affect the precision of all methods. Furthermore, the PE mapping information can improve the method’s precision but can not always guarantee the increase of F1 score. Overall, no method is dominant for all SE RNA-Seq data. The RNA-based methods perform better for the long-read datasets but are worse for the short-read datasets. In contrast, the CLIP-Seq based methods outperform the RNA-Seq based methods for all the short-read samples. Combining the results of these methods can significantly improve precision in the CLIP-Seq data.ConclusionsThe results provide a systematic evaluation of circRNA detection methods on SE RNA-Seq data that would facilitate researchers’ strategies in circRNA analysis.

Read full abstract

ABSTRACT Spartina alterniflora has become one of the top exotic invasive plants in coastal China. Accurate and timely mapping of coastal wetlands is critical to understand the spatiotemporal dynamics of S. alterniflora invasion and native species degradation. Due to the humid weather and frequent cloud cover in coastal wetlands, pixel-based classifications based on time-series satellite observations have been increasingly utilized to mitigate this problem. However, the spatial and temporal variations in valid observations might affect pixel-based classification accuracies, while this issue has been paid little attention in previous research. In this paper, we conducted annual mapping of the Yellow River Delta (YRD) wetland from 2008 to 2019 based on time-series Landsat 5/7/8 images using harmonic regression analysis and random forest classification on Google Earth Engine. We further analysed how valid observations and temporal distribution of observations affected the classification accuracy. Our results showed that the overall classification accuracies ranged from 87.25% in 2012 to 92.36% in 2018. The mapping results showed that S. alterniflora expanded from 39.91 ha to 4672.38 ha from 2008 to 2019, encroaching seagrass beds with 902.32 ha and encroaching S.salsa and bare flat with 3730.15 ha. This is the first time that maps of coastal wetlands of YRD are produced at an annual time step. Our analysis showed that the number of valid observations affected classification accuracies both temporally and spatially. Years with higher density of observations witnessed higher overall accuracies compared to the years with fewer observations. Pixels with more observations had a higher chance to be correctly classified compared to the pixels with fewer observations. Harmonic regression features helped improve classification accuracies (increase in overall accuracies from 0.15% to 3.32%), especially for vegetation types (increase in F-score from 0.19% to 3.86%). Greater number of valid observations enhances the importance of harmonic regression features in classification. The critical months identified were March, July, and October, and the combinations of these months achieved better accuracies (91.24%) than using all other observations (89.68%) in 2018, suggesting that temporal distributions of observations could be more important than the number of observations for harmonic-based classification of coastal wetlands such as YRD.

Read full abstract

Increase In F1 Score Research Articles

Related Topics

Articles published on Increase In F1 Score

Evaluation of methods to detect circular RNAs from single-end RNA-sequencing data

Incorporating uncertainty in learning to defer algorithms for safe computer-aided diagnosis

Tree-KGQA: An Unsupervised Approach for Question Answering Over Knowledge Graphs

Improve Representation for Cross-Language Clone Detection by Pretrain Using Tree Autoencoder

GCS-YOLOV4-Tiny: A lightweight group convolution network for multi-stage fruit detection.

IEPet: A Lightweight Multiscale Infrared Environmental Perception Network

Mapping coastal wetlands in the Yellow River Delta, China during 2008–2019: impacts of valid observations, harmonic regression, and critical months

A Deep Learning Enabled Multi-Class Plant Disease Detection Model Based on Computer Vision

Multimodal Classification of Parkinson's Disease in Home Environments with Resiliency to Missing Modalities.

Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose

Feature-based evidential reasoning for probabilistic risk analysis and prediction

SelfHAR

Improved Influenza Diagnostics through Thermal Contrast Amplification.

The Evolution of Language Models Applied to Emotion Analysis of Arabic Tweets

Re-Ranking System with BERT for Biomedical Concept Normalization

LCBPA: An Enhanced Deep Neural Network-Oriented Bug Prioritization and Assignment Technique Using Content-Based Filtering

Abstract 13797: Enabling Advanced Real-world Evidence in Heart Failure: A Pilot Study Defining Preferred Approaches to Electronic Health Record Data Use

Speeded-up Convolution Neural Network for classification tasks using multiscale 2-dimensional decomposition

An ontology-based learning approach for automatically classifying security requirements

TDP: Two-dimensional perceptron for image recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Increase In F1 Score Research Articles

Related Topics

Articles published on Increase In F1 Score

Evaluation of methods to detect circular RNAs from single-end RNA-sequencing data

Incorporating uncertainty in learning to defer algorithms for safe computer-aided diagnosis

Tree-KGQA: An Unsupervised Approach for Question Answering Over Knowledge Graphs

Improve Representation for Cross-Language Clone Detection by Pretrain Using Tree Autoencoder

GCS-YOLOV4-Tiny: A lightweight group convolution network for multi-stage fruit detection.

IEPet: A Lightweight Multiscale Infrared Environmental Perception Network

Mapping coastal wetlands in the Yellow River Delta, China during 2008–2019: impacts of valid observations, harmonic regression, and critical months

A Deep Learning Enabled Multi-Class Plant Disease Detection Model Based on Computer Vision

Multimodal Classification of Parkinson's Disease in Home Environments with Resiliency to Missing Modalities.

Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose

Feature-based evidential reasoning for probabilistic risk analysis and prediction

SelfHAR

Improved Influenza Diagnostics through Thermal Contrast Amplification.

The Evolution of Language Models Applied to Emotion Analysis of Arabic Tweets

Re-Ranking System with BERT for Biomedical Concept Normalization

LCBPA: An Enhanced Deep Neural Network-Oriented Bug Prioritization and Assignment Technique Using Content-Based Filtering

Abstract 13797: Enabling Advanced Real-world Evidence in Heart Failure: A Pilot Study Defining Preferred Approaches to Electronic Health Record Data Use

Speeded-up Convolution Neural Network for classification tasks using multiscale 2-dimensional decomposition

An ontology-based learning approach for automatically classifying security requirements

TDP: Two-dimensional perceptron for image recognition