Measurement Bias Research Articles

The gas in the interstellar medium (ISM) of galaxies is supersonically turbulent. Measurements of turbulence typically rely on cold gas emission lines for low-$z$ galaxies and warm ionized gas observations for $z > 0$ galaxies. Studies of warm gas kinematics at $z > 0$ conclude that the turbulence strongly evolves as a function of redshift, due to the increasing impact of gas accretion and mergers in the early Universe. However, recent findings suggest potential biases in turbulence measurements derived from ionized gas at high-$z$, impacting our understanding of turbulence origin, ISM physics and disk formation. We investigate the evolution of turbulence using velocity dispersion (sigma ) measurements from cold gas tracers (i.e., CO CI CII ). The initial dataset comprises 17 galaxy disks with high data quality from the ALPAKA sample, supplemented with galaxies from the literature, resulting in a sample of 57 galaxy disks spanning the redshift range $z = 0 - 5$. This extended sample consists of main-sequence and starburst galaxies with stellar masses $ M_ odot $. The comparison with current Halpha kinematic observations and existing models demonstrates that the velocity dispersion inferred from cold gas tracers differ by a factor of $ 3$ from those obtained using emission lines tracing the warm, ionized gas. We show that stellar feedback is the main driver of turbulence measured from cold gas tracers and the physics of turbulence driving does not appear to evolve with time. This is fundamentally different from the conclusions of studies based on warm gas, which had to consider additional turbulence drivers to explain the high values of sigma . We present a model predicting the redshift evolution of turbulence in galaxy disks, attributing the increase of sigma with redshift to the higher energy injected by supernovae due to the elevated star-formation rate in high-$z$ galaxies. This supernova-driven model suggests that turbulence is lower in galaxies with lower stellar mass compared to those with higher stellar mass. Additionally, it forecasts the evolution of sigma in Milky-Way like progenitors.

Read full abstract

AbstractLarge language models (LLMs) are increasingly adopted in educational contexts to provide personalized support to students and teachers. The unprecedented capacity of LLM‐based applications to understand and generate natural language can potentially improve instructional effectiveness and learning outcomes, but the integration of LLMs in education technology has renewed concerns over algorithmic bias, which may exacerbate educational inequalities. Building on prior work that mapped the traditional machine learning life cycle, we provide a framework of the LLM life cycle from the initial development of LLMs to customizing pre‐trained models for various applications in educational settings. We explain each step in the LLM life cycle and identify potential sources of bias that may arise in the context of education. We discuss why current measures of bias from traditional machine learning fail to transfer to LLM‐generated text (eg, tutoring conversations) because text encodings are high‐dimensional, there can be multiple correct responses, and tailoring responses may be pedagogically desirable rather than unfair. The proposed framework clarifies the complex nature of bias in LLM applications and provides practical guidance for their evaluation to promote educational equity. Practitioner notesWhat is already known about this topic The life cycle of traditional machine learning (ML) applications which focus on predicting labels is well understood. Biases are known to enter in traditional ML applications at various points in the life cycle, and methods to measure and mitigate these biases have been developed and tested. Large language models (LLMs) and other forms of generative artificial intelligence (GenAI) are increasingly adopted in education technologies (EdTech), but current evaluation approaches are not specific to the domain of education. What this paper adds A holistic perspective of the LLM life cycle with domain‐specific examples in education to highlight opportunities and challenges for incorporating natural language understanding (NLU) and natural language generation (NLG) into EdTech. Potential sources of bias are identified in each step of the LLM life cycle and discussed in the context of education. A framework for understanding where to expect potential harms of LLMs for students, teachers, and other users of GenAI technology in education, which can guide approaches to bias measurement and mitigation. Implications for practice and/or policy Education practitioners and policymakers should be aware that biases can originate from a multitude of steps in the LLM life cycle, and the life cycle perspective offers them a heuristic for asking technology developers to explain each step to assess the risk of bias. Measuring the biases of systems that use LLMs in education is more complex than with traditional ML, in large part because the evaluation of natural language generation is highly context‐dependent (eg, what counts as good feedback on an assignment varies). EdTech developers can play an important role in collecting and curating datasets for the evaluation and benchmarking of LLM applications moving forward.

Read full abstract

Measurement Bias Research Articles

Related Topics

Articles published on Measurement Bias

The Impact of Sparse Datasets When Harmonizing Data from Studies with Different Measures of the Same Construct.

Does terminology matter when measuring stigmatizing attitudes about weight? Validation of a brief, modified attitudes toward obese persons scale.

Associations of Major Lifetime and Everyday Discrimination with Cognitive Function among Middle-Aged and Older Adults.

Imaging technology to assess tissue oxygen saturation of the gastric conduit in thoracic esophagectomy

Multimodal optical imaging of the oculofacial region using a solid tissue-simulating facial phantom.

Estimating Salmonella Typhimurium Growth on Chicken Breast Fillets Under Simulated Less-Than-Truckload Dynamic Temperature Abuse.

Comparison of automated corneal endothelial cell analysis in healthy and postoperative eyes with phakic intraocular lens: a cross-sectional study and literature review

On Kinematic Measurements of Self-gravity in Protoplanetary Disks

The ALMA-ALPAKA survey. II. Evolution of turbulence in galaxy disks across cosmic time: Difference between cold and warm gas

Ranging Atom Probe Spectra to Reduce Measurement Bias

Does Terminology Matter When Measuring Stigmatizing Attitudes About Weight? Validation of a Brief, Modified Attitudes Toward Obese Persons Scale.

Measurement error and bias in real-world oncology endpoints when constructing external control arms

Provider Implicit Racial Bias in Pediatric Sickle Cell Disease.

Optimization of cervical cord synthetic T1-weighted MRI for enhancing clinical application in neurodegenerative spinal cord disorders

Developing a Technological Pedagogical and Content Knowledge (TPACK) survey for university teachers

The Ambiguous Cue Task: Measurement reliability of an experimental paradigm for the assessment of interpretation bias and associations with mental health

A sum of its parts: A systematic review evaluating biopsychosocial and behavioral determinants of perinatal depression.

The life cycle of large language models in education: A framework for understanding sources of bias

Performance of Microsoft Azure Kinect DK as a tool for estimating human body segment lengths

Psychometric Properties of the Attitude Toward Statistics Scale on a Bangladeshi Sample

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Measurement Bias Research Articles

Related Topics

Articles published on Measurement Bias

The Impact of Sparse Datasets When Harmonizing Data from Studies with Different Measures of the Same Construct.

Does terminology matter when measuring stigmatizing attitudes about weight? Validation of a brief, modified attitudes toward obese persons scale.

Associations of Major Lifetime and Everyday Discrimination with Cognitive Function among Middle-Aged and Older Adults.

Imaging technology to assess tissue oxygen saturation of the gastric conduit in thoracic esophagectomy

Multimodal optical imaging of the oculofacial region using a solid tissue-simulating facial phantom.

Estimating Salmonella Typhimurium Growth on Chicken Breast Fillets Under Simulated Less-Than-Truckload Dynamic Temperature Abuse.

Comparison of automated corneal endothelial cell analysis in healthy and postoperative eyes with phakic intraocular lens: a cross-sectional study and literature review

On Kinematic Measurements of Self-gravity in Protoplanetary Disks

The ALMA-ALPAKA survey. II. Evolution of turbulence in galaxy disks across cosmic time: Difference between cold and warm gas

Ranging Atom Probe Spectra to Reduce Measurement Bias

Does Terminology Matter When Measuring Stigmatizing Attitudes About Weight? Validation of a Brief, Modified Attitudes Toward Obese Persons Scale.

Measurement error and bias in real-world oncology endpoints when constructing external control arms

Provider Implicit Racial Bias in Pediatric Sickle Cell Disease.

Optimization of cervical cord synthetic T1-weighted MRI for enhancing clinical application in neurodegenerative spinal cord disorders

Developing a Technological Pedagogical and Content Knowledge (TPACK) survey for university teachers

The Ambiguous Cue Task: Measurement reliability of an experimental paradigm for the assessment of interpretation bias and associations with mental health

A sum of its parts: A systematic review evaluating biopsychosocial and behavioral determinants of perinatal depression.

The life cycle of large language models in education: A framework for understanding sources of bias

Performance of Microsoft Azure Kinect DK as a tool for estimating human body segment lengths

Psychometric Properties of the Attitude Toward Statistics Scale on a Bangladeshi Sample