Outlier Detection Rules Research Articles

Background: DNA microarray technology allows researchers to measure the expression levels of thousands of genes simultaneously. The main objective of microarray gene expression (GE) data analysis is to detect biomarker genes that are Differentially Expressed (DE) between two or more experimental groups/conditions. Objective: There are some popular statistical methods in the literature for the selection of biomarker genes. However, most of them often produce misleading results in presence of outliers. Therefore, in this study, we introduce a robust approach to overcome the problems of classical methods. Methods: We use median and median absolute deviation (MAD) for our robust procedure. In this procedure, a gene was considered as outlying gene if at least one of the expressions of this gene does not belong to a certain interval of the proposed outlier detection rule. Otherwise, this gene was considered as a non-outlying gene. Results: We investigate the performance of the proposed method in a comparison of the traditional method using both simulated and real gene expression data analysis. From a real colon cancer gene expression data analysis, the proposed method detected an additional fourteen (14) DE genes that were not detected by the traditional methods. Using the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis, we observed that these additional 14 DE genes are involved in three important metabolic pathways of cancer disease. The proposed method also detected nine (9) additional DE genes from another head-and-neck cancer gene expression data analysis; those involved in top ten metabolic pathways obtain from the KEGG pathway database. Conclusion: The simulation as well as real cancer gene expression datasets results show better performance with our proposed procedure. Therefore, the additional genes detected by the proposed procedure require further wet lab validation.

Read full abstract

BackgroundMitochondrial DNA is an ideal source of information to conduct evolutionary and phylogenetic studies due to its extraordinary properties and abundance. Many insights can be gained from these, including but not limited to screening genetic variation to identify potentially deleterious mutations. However, such advances require efficient solutions to very difficult computational problems, a need that is hampered by the very plenty of data that confers strength to the analysis.ResultsWe develop a systematic, automated methodology to overcome these difficulties, building from readily available, public sequence databases to high-quality alignments and phylogenetic trees. Within each stage in an autonomous workflow, outputs are carefully evaluated and outlier detection rules defined to integrate expert knowledge and automated curation, hence avoiding the manual bottleneck found in past approaches to the problem. Using these techniques, we have performed exhaustive updates to the human mitochondrial phylogeny, illustrating the power and computational scalability of our approach, and we have conducted some initial analyses on the resulting phylogenies.ConclusionsThe problem at hand demands careful definition of inputs and adequate algorithmic treatment for its solutions to be realistic and useful. It is possible to define formal rules to address the former requirement by refining inputs directly and through their combination as outputs, and the latter are also of help to ascertain the performance of chosen algorithms. Rules can exploit known or inferred properties of datasets to simplify inputs through partitioning, therefore cutting computational costs and affording work on rapidly growing, otherwise intractable datasets. Although expert guidance may be necessary to assist the learning process, low-risk results can be fully automated and have proved themselves convenient and valuable.

Read full abstract

Outlier Detection Rules Research Articles

Related Topics

Articles published on Outlier Detection Rules

Iterative outlier detection and refinement rule of compensation for phase aberrations in digital holographic microscopy.

Robust estimation for bivariate integer-valued autoregressive models based on minimum density power divergence

Analysis of outlier detection rules based on the ASHRAE global thermal comfort database

Robust Fitting of a Wrapped Normal Model to Multivariate Circular Data and Outlier Detection

Outlier Detection Based on Multivariable Panel Data and K‐Means Clustering for Dam Deformation Monitoring Data

Performance Improvement of Gene Selection Methods using Outlier Modification Rule

Weighted likelihood estimation of multivariate location and scatter

ICS for multivariate outlier detection with application to quality control

Multiple outliers detection in sparse high-dimensional regression

To Remove or not to Remove: the Impact of Outlier Handling on Significance Testing in Testosterone Data

Comments on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination

Boxplot-Based Outlier Detection for the Location-Scale Family

Resistant estimators in Poisson and Gamma models with missing responses and an application to outlier detection

Adjusted functional boxplots for spatio‐temporal data visualization and outlier detection

Rebooting the human mitochondrial phylogeny: an automated and scalable methodology with expert knowledge.

Adaptive trimmed t‐statistics for identifying predominantly high expression in a microarray experiment

Error rates for multivariate outlier detection

Outlier detection for skewed data

Statistical algorithm for assuring similar efficiency in standards and samples for absolute quantification by real-time reverse transcription polymerase chain reaction

Outlier detection and trimmed means for poisson data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Outlier Detection Rules Research Articles

Related Topics

Articles published on Outlier Detection Rules

Iterative outlier detection and refinement rule of compensation for phase aberrations in digital holographic microscopy.

Robust estimation for bivariate integer-valued autoregressive models based on minimum density power divergence

Analysis of outlier detection rules based on the ASHRAE global thermal comfort database

Robust Fitting of a Wrapped Normal Model to Multivariate Circular Data and Outlier Detection

Outlier Detection Based on Multivariable Panel Data and K‐Means Clustering for Dam Deformation Monitoring Data

Performance Improvement of Gene Selection Methods using Outlier Modification Rule

Weighted likelihood estimation of multivariate location and scatter

ICS for multivariate outlier detection with application to quality control

Multiple outliers detection in sparse high-dimensional regression

To Remove or not to Remove: the Impact of Outlier Handling on Significance Testing in Testosterone Data

Comments on: Robust estimation of multivariate location and scatter in the presence of cellwise and casewise contamination

Boxplot-Based Outlier Detection for the Location-Scale Family

Resistant estimators in Poisson and Gamma models with missing responses and an application to outlier detection

Adjusted functional boxplots for spatio‐temporal data visualization and outlier detection

Rebooting the human mitochondrial phylogeny: an automated and scalable methodology with expert knowledge.

Adaptive trimmed t‐statistics for identifying predominantly high expression in a microarray experiment

Error rates for multivariate outlier detection

Outlier detection for skewed data

Statistical algorithm for assuring similar efficiency in standards and samples for absolute quantification by real-time reverse transcription polymerase chain reaction

Outlier detection and trimmed means for poisson data