Clustering Of Time Series Data Research Articles

Introduction: Physical activity (PA) is a important modifiable risk factor for cardiovascular disease (CVD) and CVD mortality. There is growing evidence that sexual and gender minority (SGM; lesbian/gay, bisexual, transgender) adults are at higher risk of insufficient PA and CVD. Accelerometers allow for collection of granular PA data that can identify individual- and temporal heterogeneity in daily patterns. To date, no studies have investigated accelerometry-based PA patterns among SGM adults. Goal: To characterize daily PA trajectories among SGM adults and identify distinct phenotypes using a time series-based clustering technique. Methods: We recruited an online sample of healthy SGM adults in the United States who completed continuous wrist-worn accelerometry for 30 days to collect step counts. For clustering, we used a functional latent block models (FLBMs), where each 24-hour period was a Fourier-smoothed data curve. FLBMs allow hierarchical time-series data clustering by simultaneously clustering person- and day-level data. The optimal number of clusters was selected using the integrated completed likelihood (ICL) criterion. Results: Forty-two SGM adults with a mean age of 27.0 years (+/- 7.7) provided 1207 person-level days of accelerometry (range=6-31 days/person). A final model with 4 blocks (2 x 2 clusters) provided the best fit (ICL= -70086.4). Two person-level clusters were identified, which were characterized by differences in the amount and distribution of steps throughout the day were identified. Cluster 1 (n=26) had higher overall steps counts with significant morning and evening increases in step counts. Cluster 2 (n=16) had fewer steps that were within a narrower time period. We further identified 2 day-level clusters. Cluster 2 (n=14) had a wider temporal distribution of step counts and a higher variance on weekend days vs. weekdays relative to Cluster 1 (n=17). Conclusions: This is the first study to elucidate daily PA trajectories in SGM people. Using FLBM, we accounted for individual heterogeneity and relations among days. Findings can help identify individuals at increased risk of physical inactivity and subsequent negative health outcomes, providing important knowledge to inform behavioral interventions.

Read full abstract

Background: Proliferating alloreactive T cells have a central role in the induction of acute GVHD (aGVHD) making them a promising target in preventing alloreactivity. T cell depleting regimens such as anti-T-lymphocyte globulin (ATG) or post-transplant cyclophosphamide (PTCy) effectively eliminate these cells and reduce aGVHD after hematopoietic cell transplantation (HCT) with marginal differences in clinical outcome. Comparative immune reconstitution analyses could contribute to answer the question, which agent would be most appropriate for an individual patient. However, not only effects relating to the different T cell depleting regimens, i.e. ATG or PTCy may be relevant but the general heterogeneity of T cell reconstitution. Aims: Here, we aimed at dissecting this heterogeneity with an approach called time-series clustering to better understand the impact of both regimens in individual patients and to consequently identify distinct patient subsets which benefit the most from each protocol. Methods: We retrospectively compared immune reconstitution data of 339 recipients of matched-unrelated donor (MUD) HCT with either ATG (n=304) or PTCy (n=35) as T cell depletion via conventional analysis on the cohort level and for its individual heterogeneity via time-series clustering. This analysis leveraged the approach of dynamic time warping (DTW) to determine the distance measure later used for the partitional clustering of individual patient time-series data. The performance of this methods was evaluated by the silhouette coefficient (Sil) which indicates a separation of clusters with values between 0 (overlapping clusters) and +1 (best separation); A 10-fold resampling was used to assess model robustness. Cluster information were used for subsequent analysis of clinical outcomes. All analysis were performed using R packages R stats, dtwclust, survival, survminer and cmprsk. Results: Comparative analysis of cellular reconstitution revealed distinct prominent T cell population after each protocol. While patients receiving PTCy as GVHD prophylaxis presented with higher levels of regulatory T cells, ATG patients had higher levels of γδ T- or NK-T cells. Time-series clustering of T cell subpopulations that associate with GVHD successfully dissected each population’s heterogeneity. For ATG patients the clustering revealed two distinct clusters with its optimal model configuration showing a good and robust silhouette coefficient (7_1: Sil=0.524) and a balanced patient distribution (Fig. 1A). In this model, clustering of ATG patients was driven by αβ- and activated T cells as those revealed higher absolute counts and a greater difference in shape over time as compared to regulatory- and γδ T cells. Patients in ATG cluster 1, showing higher absolute counts of αβ- and activated T cells compared to cluster 2 were associated to a significantly higher 1-year OS (98% vs. 79%, p=0.0023) due to lower NRM (p=0.032) and relapse (p=0.01). The clustering of PTCy patients distinguished two clusters with lower silhouette coefficients had a strong impact of αβ- and activated T cells (Fig.1B). Comparing the overall survival probability of both ATG clusters with the PTCy cohort revealed a significant difference driven by the ATG clusters (Fig 1C). Image:Summary/Conclusion: Beyond a differential impact of ATG and PTCy on immune reconstitution our analysis identified phenotypes that reproducibly associated with impaired clinical outcomes within the same T cell depletion platform. This provides guidance for individually choosing the most appropriate agent in the MUD setting.

Read full abstract

Clustering Of Time Series Data Research Articles

Related Topics

Articles published on Clustering Of Time Series Data

Clustering plasma concentration-time curves: applications ofunsupervised learning in pharmacogenomics

1DCAE-TSSAMC: Two-Stage Multi-Dimensional Spatial Features Based Multi-View Deep Clustering for Time Series Data

Time-series data clustering with load-shape preservation for identifying residential energy consumption behaviors

Evolutionary Multi-Tasking Optimization for High-Efficiency Time Series Data Clustering

Self-Supervised Framework Based on Subject-Wise Clustering for Human Subject Time Series Data

The ensemble distance on model-based clustering for regions clustering based on rainfall: The case of rainfall in West Java Indonesia

Time-constrained Gaussian mixture model for clustering multi-modal chemical process data

Temporal Multi-features Representation Learning-Based Clustering for Time-Series Data

Bayesian Semiparametric Local Clustering of Multiple Time Series Data

Abstract 11546: Functional Co-Clustering of Physical Activity Patterns Among Sexual and Gender Minority Adults

A hybrid machine learning approach for the load prediction in the sustainable transition of district heating networks

Deep Temporal Contrastive Clustering

Identifying responders to elamipretide in Barth syndrome: Hierarchical clustering for time series data

Fractal dimension based geographical clustering of COVID-19 time series data

Spatiotemporal Sequence-to-Sequence Clustering for Electric Load Forecasting

Optimization-Assisting Dual-Step Clustering of Time Series Data

A Fast Weighted Fuzzy C-Medoids Clustering for Time Series Data Based on P-Splines.

Hydrological Time Series Clustering: A Case Study of Telemetry Stations in Thailand

P1317: TIME SERIES CLUSTERING OF T CELL SUBSETS DISSECTS HETEROGENEITY IN IMMUNE RECONSTITUTION AND SURVIVAL AMONG RECIPIENTS OF MUD-HCT WITH ATG OR PTCY

The use of infrared spectroscopy and chemometrics to investigate deterioration in vegetable tanned leather: potential applications in heritage science

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Clustering Of Time Series Data Research Articles

Related Topics

Articles published on Clustering Of Time Series Data

Clustering plasma concentration-time curves: applications ofunsupervised learning in pharmacogenomics

1DCAE-TSSAMC: Two-Stage Multi-Dimensional Spatial Features Based Multi-View Deep Clustering for Time Series Data

Time-series data clustering with load-shape preservation for identifying residential energy consumption behaviors

Evolutionary Multi-Tasking Optimization for High-Efficiency Time Series Data Clustering

Self-Supervised Framework Based on Subject-Wise Clustering for Human Subject Time Series Data

The ensemble distance on model-based clustering for regions clustering based on rainfall: The case of rainfall in West Java Indonesia

Time-constrained Gaussian mixture model for clustering multi-modal chemical process data

Temporal Multi-features Representation Learning-Based Clustering for Time-Series Data

Bayesian Semiparametric Local Clustering of Multiple Time Series Data

Abstract 11546: Functional Co-Clustering of Physical Activity Patterns Among Sexual and Gender Minority Adults

A hybrid machine learning approach for the load prediction in the sustainable transition of district heating networks

Deep Temporal Contrastive Clustering

Identifying responders to elamipretide in Barth syndrome: Hierarchical clustering for time series data

Fractal dimension based geographical clustering of COVID-19 time series data

Spatiotemporal Sequence-to-Sequence Clustering for Electric Load Forecasting

Optimization-Assisting Dual-Step Clustering of Time Series Data

A Fast Weighted Fuzzy C-Medoids Clustering for Time Series Data Based on P-Splines.

Hydrological Time Series Clustering: A Case Study of Telemetry Stations in Thailand

P1317: TIME SERIES CLUSTERING OF T CELL SUBSETS DISSECTS HETEROGENEITY IN IMMUNE RECONSTITUTION AND SURVIVAL AMONG RECIPIENTS OF MUD-HCT WITH ATG OR PTCY

The use of infrared spectroscopy and chemometrics to investigate deterioration in vegetable tanned leather: potential applications in heritage science