Advanced Statistical Algorithms Research Articles

Objective: The objective of this study is to present a novel framework, termed the knockoff technique, to evaluate different metric ranking algorithms to better describe human response to injury.Methods: Many biomechanical metrics are routinely obtained from impact tests using postmortem human surrogates (PMHS) to develop injury risk curves (IRCs). The IRCs form the basis to evaluate human safety in crashworthiness environments. The biomechanical metrics should be chosen based on some measure of their predictive ability. Commonly used algorithms for the choice of ranking the metrics include (a) areas under the receiver operating characteristic curve (AUROC), time-varying AUROC, and other adaptations, and (b) some variants of predictive squared error loss. This article develops a rigorous framework to evaluate the metric selection/ranking algorithms. Actual experimental data are used due to the shortcoming of using simulated data. The knockoff data are meshed into existing experimental data using advanced statistical algorithms. Error rate measures such as false discovery rates (FDRs) and bias are calculated using the knockoff technique. Experimental data are used from previously published whole-body PMHS side impact sled tests. The experiments were conducted at different velocities, padding and rigid load wall conditions, and offsets and with different supplemental restraint systems. The PMHS specimens were subjected to a single lateral impact loading resulting in injury and noninjury outcomes.Results: A total of 25 metrics were used from 42 tests. The AUROC-type algorithms tended to have higher FDRs compared to the squared error loss–type functions (45.3% for the best AUROC-type algorithms versus 31.4% for the best Brier score algorithm). Standard errors for the Brier score algorithm also tended to be lower, indicative of more stable metric choices and robust rankings. The wide variations observed in the performance of the algorithms demonstrated the need for data set–specific evaluation tools such as the knockoff technique developed in this study.Conclusions: In the present data set, the AUROCs and related binary classification algorithms led to inflated FDRs, rendering metric selection/ranking questionable. This is particularly true for data sets with a high proportion of censoring. Squared error loss–type algorithms (such as the Brier score algorithm or its modifications) improved the performance in the metric selection process. The presented new knockoff technique may wholly change how IRCs are developed from impact experiments or simulations. At the very least, the knockoff technique demonstrates the need for evaluations among different metric ranking/selection algorithms, especially when they produce substantially different biomechanical metric choices. Without recommending the AUROC-type or Brier score–type algorithms universally, the authors suggest careful assessments of these algorithms using the proposed framework, so that a robust algorithm may be chosen, with respect to the nature of the experimental data set. Though results are given for sets from a published series of experiments, the knockoff technique is being used by the authors in tests that are applicable to the automotive, aviation, military, and other environments.

Understanding the role of DNA methylation often requires accurate assessment and comparison of these modifications in a genome-wide fashion. Sequencing-based DNA methylation profiling provides an unprecedented opportunity to map and compare complete DNA CpG methylomes. These include whole genome bisulfite sequencing (WGBS), Reduced-Representation Bisulfite-Sequencing (RRBS), and enrichment-based methods such as MeDIP-seq, MBD-seq, and MRE-seq. An investigator needs a method that is flexible with the quantity of input DNA, provides the appropriate balance among genomic CpG coverage, resolution, quantitative accuracy, and cost, and comes with robust bioinformatics software for analyzing the data. In this chapter, we describe four protocols that combine state-of-the-art experimental strategies with state-of-the-art computational algorithms to achieve this goal. We first introduce two experimental methods that are complementary to each other. MeDIP-seq, or methylation-dependent immunoprecipitation followed by sequencing, uses an anti-methylcytidine antibody to enrich for methylated DNA fragments, and uses massively parallel sequencing to reveal identity of enriched DNA. MRE-seq, or methylation-sensitive restriction enzyme digestion followed by sequencing, relies on a collection of restriction enzymes that recognize CpG containing sequence motifs, but only cut when the CpG is unmethylated. Digested DNA fragments enrich for unmethylated CpGs at their ends, and these CpGs are revealed by massively parallel sequencing. The two computational methods both implement advanced statistical algorithms that integrate MeDIP-seq and MRE-seq data. M&M is a statistical framework to detect differentially methylated regions between two samples. methylCRF is a machine learning framework that predicts CpG methylation levels at single CpG resolution, thus raising the resolution and coverage of MeDIP-seq and MRE-seq to a comparable level of WGBS, but only incurring a cost of less than 5% of WGBS. Together these methods form an effective, robust, and affordable platform for the investigation of genome-wide DNA methylation.

Advanced Statistical Algorithms Research Articles

Articles published on Advanced Statistical Algorithms

Challenges and opportunities beyond structured data in analysis of electronic health records

Fast and In-Situ Identification of Archaeometallurgical Collections in the Museum of Malaga Using Laser-Induced Breakdown Spectroscopy and a New Mathematical Algorithm

Atomic locations of minor dopants and their roles in the stabilization ofη−Cu6Sn5

Atomic Locations of Minor Dopants and Their Roles in the Stabilization of <i>η</i>-Cu <sub>6</sub>Sn <sub>5</sub>

Novel learning framework (knockoff technique) to evaluate metric ranking algorithms to describe human response to injury

A Data-Analytics Tutorial: Building Predictive Models for Oil Production in an Unconventional Shale Reservoir

Comprehensive Whole DNA Methylome Analysis by Integrating MeDIP-seq and MRE-seq.

Qualitative Assessments via Infrared Vision of Sub-surface Defects Present Beneath Decorative Surface Coatings

IQuant: an automated pipeline for quantitative proteomics based upon isobaric tags.

Indirect Performance Sensing for On-Chip Self-Healing of Analog and RF Circuits

160-P: neXtype: A SAFE CAR ON THE NGS ROLLER COASTER

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Advanced Statistical Algorithms Research Articles

Articles published on Advanced Statistical Algorithms

Challenges and opportunities beyond structured data in analysis of electronic health records

Fast and In-Situ Identification of Archaeometallurgical Collections in the Museum of Malaga Using Laser-Induced Breakdown Spectroscopy and a New Mathematical Algorithm

Atomic locations of minor dopants and their roles in the stabilization ofη−Cu6Sn5

Atomic Locations of Minor Dopants and Their Roles in the Stabilization of &lt;i&gt;η&lt;/i&gt;-Cu &lt;sub&gt;6&lt;/sub&gt;Sn &lt;sub&gt;5&lt;/sub&gt;

Novel learning framework (knockoff technique) to evaluate metric ranking algorithms to describe human response to injury

A Data-Analytics Tutorial: Building Predictive Models for Oil Production in an Unconventional Shale Reservoir

Comprehensive Whole DNA Methylome Analysis by Integrating MeDIP-seq and MRE-seq.

Qualitative Assessments via Infrared Vision of Sub-surface Defects Present Beneath Decorative Surface Coatings

IQuant: an automated pipeline for quantitative proteomics based upon isobaric tags.

Indirect Performance Sensing for On-Chip Self-Healing of Analog and RF Circuits

160-P: neXtype: A SAFE CAR ON THE NGS ROLLER COASTER

Atomic Locations of Minor Dopants and Their Roles in the Stabilization of <i>η</i>-Cu <sub>6</sub>Sn <sub>5</sub>