Single-sample Data Research Articles

• Pre-processing techniques to standardise TCATA by modality data were compared. • Time standardisation was unable to significantly reduce panellist noise. • Time standardisation by modality caused a decrease in repeatability parameters. • Standardising data with merged modalities largely maintained patters in the data. Temporal sensory profiles are increasingly assessed ‘by modality’ to investigate complex profiles and multisensory properties of foods and beverages. Panellists’ noise in temporal data caused by differences in oral and cognitive processing cannot entirely be removed by training or strict experimental setups. Therefore, time standardisation can be applied to align onsets of sensations and standardise temporal data. This paper compared raw temporal data collected in a preceding study performed by a trained expert panel (n = 10) using a TCATA by modality approach with the same data time standardised either by modality or with merged modalities. Binary data, durations and citation proportions were evaluated and subjected to Repeated Measures (RM-) Analysis of Variance (ANOVA), Canonical Variate Analysis (CVA), and Multiple Factor Analysis (MFA) to investigate the differences between sensory properties and dynamic profiles. Time standardisation with merged modalities was able to reduce some noise related to panel repeatability from the raw data while also improving panel agreement indices in the taste and mouthfeel data. Time standardisation by modality reduced some of the panel heterogeneity, but distorted patterns in the flavour data. The main reason for distorted patterns in single sample data and resulting sample discrimination was the different impact of time standardisation on samples described by quickly fading versus long lasting sensations. No substantial effects were observed on the samples’ overall profiles in their sensory space. Time standardisation by modality could not reduce panellists’ noise in the data. Only a slight noise reduction was achieved in the time standardised data with merged modalities supporting the use of the raw data for further analyses. The findings indicated several differences between raw and time standardised data and highlighted advantages and disadvantages of pre-processing TCATA by modality data obtained to describe samples inducing complex, multisensory sensations.

Read full abstract

BackgroundMany Single Nucleotide Polymorphism (SNP) calling programs have been developed to identify Single Nucleotide Variations (SNVs) in next-generation sequencing (NGS) data. However, low sequencing coverage presents challenges to accurate SNV identification, especially in single-sample data. Moreover, commonly used SNP calling programs usually include several metrics in their output files for each potential SNP. These metrics are highly correlated in complex patterns, making it extremely difficult to select SNPs for further experimental validations.ResultsTo explore solutions to the above challenges, we compare the performance of four SNP calling algorithm, SOAPsnp, Atlas-SNP2, SAMtools, and GATK, in a low-coverage single-sample sequencing dataset. Without any post-output filtering, SOAPsnp calls more SNVs than the other programs since it has fewer internal filtering criteria. Atlas-SNP2 has stringent internal filtering criteria; thus it reports the least number of SNVs. The numbers of SNVs called by GATK and SAMtools fall between SOAPsnp and Atlas-SNP2. Moreover, we explore the values of key metrics related to SNVs’ quality in each algorithm and use them as post-output filtering criteria to filter out low quality SNVs. Under different coverage cutoff values, we compare four algorithms and calculate the empirical positive calling rate and sensitivity. Our results show that: 1) the overall agreement of the four calling algorithms is low, especially in non-dbSNPs; 2) the agreement of the four algorithms is similar when using different coverage cutoffs, except that the non-dbSNPs agreement level tends to increase slightly with increasing coverage; 3) SOAPsnp, SAMtools, and GATK have a higher empirical calling rate for dbSNPs compared to non-dbSNPs; and 4) overall, GATK and Atlas-SNP2 have a relatively higher positive calling rate and sensitivity, but GATK calls more SNVs.ConclusionsOur results show that the agreement between different calling algorithms is relatively low. Thus, more caution should be used in choosing algorithms, setting filtering parameters, and designing validation studies. For reliable SNV calling results, we recommend that users employ more than one algorithm and use metrics related to calling quality and coverage as filtering criteria.

Read full abstract

Single-sample Data Research Articles

Articles published on Single-sample Data

Multivariable correlation feature network construction and health condition assessment for unlabeled single-sample data

Identifying the critical state of cancers by single-sample Markov flow entropy.

Accurate Label Refinement From Multiannotator of Remote Sensing Data

Optimisation-free density estimation and classification with quantum circuits

Towards a method to anticipate dark matter signals with deep learning at the LHC

The impact of time standardising TCATA by modality data on the multisensory profile of beer

Identifying Critical States of Complex Diseases by Single-Sample Jensen-Shannon Divergence.

Multimodal person detection system

Subway Obstacle Perception and Identification Method Based on Cloud Edge Collaboration

Disease characterization using a partial correlation-based sample-specific network

Feature embedding and conditional neural processes for data imputation

Single-sample landscape entropy reveals the imminent phase transition during disease progression

Parameter distribution characteristics of material fatigue life using improved bootstrap method

An Effective Crowdsourcing Data Reporting Scheme to Compose Cloud-Based Services in Mobile Robotic Systems

A Novel Multi-Sine Excitation Procedure for Impedance Spectroscopy Supports Automatic Drift Correction and Online Error Determination

High speed FPGA-based data acquisition system

Estimation of random errors for lidar based on noise scale factor**Project supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDB05040300) and the National Natural Science Foundation of China (Grant No. 41205119).

Measurement of the principal quasi-isentrope of lead to ~3Mbar using the "Z" machine

Comparing a few SNP calling algorithms using low-coverage sequencing data

A New Fuzzing Method Using Multi Data Samples Combination

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Single-sample Data Research Articles

Articles published on Single-sample Data

Multivariable correlation feature network construction and health condition assessment for unlabeled single-sample data

Identifying the critical state of cancers by single-sample Markov flow entropy.

Accurate Label Refinement From Multiannotator of Remote Sensing Data

Optimisation-free density estimation and classification with quantum circuits

Towards a method to anticipate dark matter signals with deep learning at the LHC

The impact of time standardising TCATA by modality data on the multisensory profile of beer

Identifying Critical States of Complex Diseases by Single-Sample Jensen-Shannon Divergence.

Multimodal person detection system

Subway Obstacle Perception and Identification Method Based on Cloud Edge Collaboration

Disease characterization using a partial correlation-based sample-specific network

Feature embedding and conditional neural processes for data imputation

Single-sample landscape entropy reveals the imminent phase transition during disease progression

Parameter distribution characteristics of material fatigue life using improved bootstrap method

An Effective Crowdsourcing Data Reporting Scheme to Compose Cloud-Based Services in Mobile Robotic Systems

A Novel Multi-Sine Excitation Procedure for Impedance Spectroscopy Supports Automatic Drift Correction and Online Error Determination

High speed FPGA-based data acquisition system

Estimation of random errors for lidar based on noise scale factor**Project supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDB05040300) and the National Natural Science Foundation of China (Grant No. 41205119).

Measurement of the principal quasi-isentrope of lead to ~3Mbar using the "Z" machine

Comparing a few SNP calling algorithms using low-coverage sequencing data

A New Fuzzing Method Using Multi Data Samples Combination