Within-subject Measures Research Articles

ABSTRACT Previous studies have concluded that wide variance in changes in insulin sensitivity markers following exercise training demonstrates heterogeneity in individual trainability. However, these studies frequently don’t account for technical, biological, and random within-subject measurement error. We used the standard deviation of individual responses (SDIR) to determine whether interindividual variability in trainability exists for fasting and postprandial insulin sensitivity outcomes following low-volume sprint interval training (SIT). We pooled data from 63 untrained participants who completed 6 weeks of SIT (n = 49; VO2max: 35 (7) mL⋅kg−1⋅min−1) or acted as no-intervention controls (n = 14; VO2max: 34 (6) mL⋅kg−1⋅min−1). Fasting and oral glucose tolerance test (OGTT)-derived measures of insulin sensitivity were measured pre- and post-intervention. SDIR values were positive and exceeded a small effect size threshold for changes in fasting glucose (SDIR = 0.27 [95%CI 0.07,0.38] mmol⋅L−1), 2-h OGTT glucose (SDIR = 0.89 [0.22,1.23] mmol⋅L−1), glucose area-under-the-curve (SDIR = 66.4 [−81.5,124.3] mmol⋅L−1⋅120min−1) and The Cederholm Index (SDIR = 7.2 [−16.0,19.0] mg⋅l2⋅mmol−1⋅mU−1⋅min−1), suggesting meaningful individual responses to SIT, whilst SDIR values were negative for fasting insulin, fasting insulin resistance and insulin AUC. For all variables, the 95% CIs were wide and/or crossed zero, highlighting uncertainty about the existence of true interindividual differences in exercise trainability. Only 2–22% of participants could be classified as responders or non-responders with more than 95% certainty. Our findings demonstrate it cannot be assumed that variation in changes in insulin sensitivity following SIT is attributable to inherent differences in trainability, and reiterate the importance of accounting for technical, biological, and random error when examining heterogeneity in health-related training adaptations. Highlights This study tested whether true interindividual variability exists for changes in insulin sensitivity and glyceamic control following 6-weeks of low volume sprint interval training (SIT). The high level of technical, biological, and random error associated with repeated measurements of insulin sensitivity and glycaemic control, means we can neither confidently conclude that there is evidence of true interindividual differences in the trainability of these outcomes following SIT, nor confidently identify responders or non-responders for such parameters. Researchers contrasting responders vs. non-responders for a given parameter, either to understand mechanisms of adaptation and/or develop physiological/genetic/epigenetic predictors of response, need to be aware that identification of responders and non-responders with sufficient certainty may not be achievable for parameters with a high level of technical, biological, and random error.

Read full abstract

KEY POINT: Nonparametric statistical tests can be a useful alternative to parametric statistical tests when the test assumptions about the data distribution are not met.In this issue of Anesthesia & Analgesia, Wang et al1 report results of a trial of the effects of preoperative gum chewing on sore throat after general anesthesia with a supraglottic airway device. The authors used the Mann-Whitney U test—a nonparametric test—to compare numerical rating scale pain scores between the groups. The majority of statistical methods—namely, parametric methods—is based on the assumption of a specific data distribution in the population from which the data were sampled. This distribution is characterized by ≥1 parameters, such as the mean and the variance for the normal (Gaussian, “bell shaped”) distribution. Parametric methods commonly seek to estimate population parameters and to test hypotheses on these parameters—for example, on means and mean differences between groups. In contrast, though the exact definition varies in literature, nonparametric methods generally do not assume a specific probability distribution. While other nonparametric methods exist, we focus here on the widely used rank-based nonparametric tests. These methods use the ranks of the data instead of their actual values and can basically be used for all data that can be ranked, including ordinal data, discrete data (like counts), and continuous data. Nonparametric methods are commonly used when data distribution assumptions of parametric tests are not met. In practice, researchers often assess whether the outcome variable is overall normally distributed and use a nonparametric test when it is not. It is worth noting, however, that rank-based nonparametric tests: usually have slightly less power than parametric tests when the underlying distributional assumptions of the parametric test are actually met, often focus on hypothesis testing rather than estimation of parameters of interest, and may not be available when more complex analyses than simple within- or between-group comparisons are required. It can thus be useful to consider whether a parametric test can be used despite apparently non-normally distributed outcome data. First, the normality assumption does not necessarily apply to the dependent variable itself but, for example, to the residuals in a linear regression model. Second, some parametric tests like the t test can be relatively robust against non-normality when the sample size is large. Third, data transformations to approximate a normal distribution can be considered. Fourth, when data follow some other well-defined distribution (eg, Poisson distribution for count data), researchers can take advantage of parametric methods designed for these specific distributions.2 The Mann-Whitney U test (also known as the Wilcoxon rank-sum test or Wilcoxon-Mann-Whitney test) used by Wang et al1 (Figure) is the nonparametric equivalent to the 2-sample t test to compare 2 independent groups. It tests the null hypothesis that both groups come from populations with the same distribution, specifically, whether randomly drawn observations from one group are more likely to be higher (or lower) than randomly drawn observations from the other group.3 Contrary to common belief, the Mann-Whitney U test does not compare the medians between groups. This is only true under the assumption that the distribution has the same shape in both groups and differs only by its location. For >2 groups, the Kruskal–Wallis test can be used as a nonparametric alternative to 1-way analysis of variance (ANOVA).Figure.: Adapted text excerpt from the statistical methods section of Wang et al1 and their Table 2. These authors used Mann-Whitney U tests to compare patient self-reported NRS pain scores (the secondary outcome), which were not normally distributed, between their chewing gum group (G Group) and the control group (C Group). NRS indicates numeric rating scale.The Wilcoxon signed rank test is used to compare 2 paired (nonindependent) groups or 2 repeated within-subject measurements, and this test assumes that the distribution of the between-group differences is symmetric. The Friedman test is the nonparametric equivalent to 1-way repeated-measures ANOVA for comparisons of >2 paired groups.4 For a nonparametric correlation analysis, Spearman rank correlation is commonly used.5

Read full abstract

Within-subject Measures Research Articles

Related Topics

Articles published on Within-subject Measures

Repeatability of vibration-controlled transient elastography versus magnetic resonance elastography in patients with cirrhosis: A prospective study.

Load and recovery monitoring in Swiss top-level youth soccer players: Exploring the associations of a new web application-based score with recognised load measures

Does visuospatial neglect contribute to standing balance within the first 12 weeks post-stroke? A prospective longitudinal cohort study.

Individual Differences in the Susceptibility to the Feature-Positive Effect

Robust neural tracking of linguistic speech representations using a convolutional neural network

Observational Study of Pediatric Cochlear Implant Recipients: Two-year Follow-up Outcomes.

Exploring interindividual differences in fasting and postprandial insulin sensitivity adaptations in response to sprint interval exercise training

Disentangling the Influence of Data Contamination in Growth Curve Modeling: A Median Based Bayesian Approach

How selves differ within and across cognitive domains: self-prioritisation, self-concept, and psychiatric traits

The Effectiveness of Unilateral Cochlear Implantation on Performance-Based and Patient-Reported Outcome Measures in Finnish Recipients.

Simulations found within-subject measurement variation in glycaemic measures may cause overdiagnosis of prediabetes and diabetes

Liver and Spleen Stiffness Surveillance Through Elastography During and After Direct-Acting Antiviral Therapy in Patients With Chronic Hepatitis C.

Notched and Nonnotched Stimuli Are Equally Effective at the Mixing-Point Level in Sound Therapy for Tinnitus Relief.

Nonparametric Statistical Methods in Medical Research.

Morphological changes in the subthalamic nucleus of people with mild-to-moderate Parkinson\u2019s disease: a 7T MRI study

An Item-Level Analysis for Detecting Faking on Personality Tests: Appropriateness of Ideal Point Item Response Theory Models.

Vascular response to social cognitive performance measured by infrared thermography: A translational study from mouse to man.

Verbal Baselining: Within-Subject Consistency of CBCA Scores across Different Truthful and Fabricated Accounts

Behavioral inhibition and EEG delta-beta correlation in early childhood: Comparing a between-subjects and within-subjects approach

Use of Repeated Within-Subject Measures to Assess Infants' Preference for Similar Others.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Within-subject Measures Research Articles

Related Topics

Articles published on Within-subject Measures

Repeatability of vibration-controlled transient elastography versus magnetic resonance elastography in patients with cirrhosis: A prospective study.

Load and recovery monitoring in Swiss top-level youth soccer players: Exploring the associations of a new web application-based score with recognised load measures

Does visuospatial neglect contribute to standing balance within the first 12 weeks post-stroke? A prospective longitudinal cohort study.

Individual Differences in the Susceptibility to the Feature-Positive Effect

Robust neural tracking of linguistic speech representations using a convolutional neural network

Observational Study of Pediatric Cochlear Implant Recipients: Two-year Follow-up Outcomes.

Exploring interindividual differences in fasting and postprandial insulin sensitivity adaptations in response to sprint interval exercise training

Disentangling the Influence of Data Contamination in Growth Curve Modeling: A Median Based Bayesian Approach

How selves differ within and across cognitive domains: self-prioritisation, self-concept, and psychiatric traits

The Effectiveness of Unilateral Cochlear Implantation on Performance-Based and Patient-Reported Outcome Measures in Finnish Recipients.

Simulations found within-subject measurement variation in glycaemic measures may cause overdiagnosis of prediabetes and diabetes

Liver and Spleen Stiffness Surveillance Through Elastography During and After Direct-Acting Antiviral Therapy in Patients With Chronic Hepatitis C.

Notched and Nonnotched Stimuli Are Equally Effective at the Mixing-Point Level in Sound Therapy for Tinnitus Relief.

Nonparametric Statistical Methods in Medical Research.

Morphological changes in the subthalamic nucleus of people with mild-to-moderate Parkinson\u2019s disease: a 7T MRI study

An Item-Level Analysis for Detecting Faking on Personality Tests: Appropriateness of Ideal Point Item Response Theory Models.

Vascular response to social cognitive performance measured by infrared thermography: A translational study from mouse to man.

Verbal Baselining: Within-Subject Consistency of CBCA Scores across Different Truthful and Fabricated Accounts

Behavioral inhibition and EEG delta-beta correlation in early childhood: Comparing a between-subjects and within-subjects approach

Use of Repeated Within-Subject Measures to Assess Infants' Preference for Similar Others.