Introduced in 2010, the sub-discipline of gerontologic biostatistics was conceptualized to address the specific challenges of analyzing data from clinical research studies involving older adults. Since then, the evolving technological landscape has led to a proliferation of advancements in biostatistics and other data sciences that have significantly influenced the practice of gerontologic research, including studies beyond the clinic. Data science is the field at the intersection of statistics and computer science, and although the term "data science" was not widely used in 2010, the field has quickly made palpable impacts on gerontologic research. In this Review in Depth, we describe multiple advancements of biostatistics and data science that have been particularly impactful. Moreover, we propose the sub-discipline of "gerontologic biostatistics and data science", or GBDS, which subsumes gerontologic biostatistics into a more encompassing practice. Prominent GBDS advancements that we discuss herein include cutting-edge methods in experimental design and causal inference, adaptations of machine learning, the rigorous quantification of deep phenotypic measurement, and analysis of high-dimensional -omics data. We additionally describe the need for integration of information from multiple studies and propose strategies to foster reproducibility, replicability, and open science. Lastly, we provide information on software resources for GBDS practitioners to apply these approaches to their own work and propose areas where further advancement is needed. The methodological topics reviewed here aim to enhance data-rich research on aging and foster the next generation of gerontologic researchers.
Read full abstract