Average Interrater Reliability Research Articles

<h3>Objective(s)</h3> To evaluate efficacy of health promotion and obesity prevention interventions among children and adolescents with DD. <h3>Data Sources</h3> Databases searched: PubMed, CINAHL, PsycINFO, Cochrane Library, Psychology and Behavioral Sciences Collection, Health Source: Nursing/Academic, Web of Science, Embase, and Scopus. Search terms: child*, adolesc*, pediatric*, disability, intellectual*, impair*, development*, autis*, down syndrome*, cognitive*; health promotion, healthy lifestyle, healthy eating, obesity intervention, obesity prevention, physical exercise; BMI, eating behaviors, physical in/activity, dietary intake, and sedentary behaviors. <h3>Study Selection</h3> Inclusion criteria: (1) children and adolescents (6-17 years) with DD (2) interventions targeting obesity outcomes within a child's natural environment; (3) peer-reviewed, experimental or quasi-experimental studies; (4) quantitative or mixed methods approach; (5) reported primary outcomes for the child; (6) published between 2010 – 2021. Reviewers worked in pairs to independently screen articles. Author and journal information was blinded. Selection was based on consensus. Average interrater reliability across screeners was high (92.7%). Of 21,889 unique records, 21 studies met selection criteria. <h3>Data Extraction</h3> Data Extraction was conducted by six research team members who worked in three pairs using a standardized form adapted from existing sources. We adapted Reichow's (2011) evaluative method for rating the rigor of each study and determining the overall level of evidence for each intervention. Interrater reliability for rating of rigor was 88%. Rating disagreements within pairs were resolved through discussion. <h3>Data Synthesis</h3> Interventions were categorized into five categories including: aerobic and strength training, sport-based physical activity, aquatic exercise, active video gaming, and diet and lifestyle interventions. None of the intervention categories met the recommended threshold for ‘established' EBP. Aquatic exercise programs and active videogaming programs approached Reichow's (2011) cut off for ‘probable EBP'. Diet and lifestyle interventions, aerobic and strength training exercise programs, and sports-based physical activity programs warranted an overall rating of ‘not an EBP'. <h3>Conclusions</h3> Interventions such as sports-based exercise programs and active video gaming showed promise, but more research is needed to confirm their effectiveness. Future research must focus on theoretically grounded and rigorous intervention studies with diverse samples. <h3>Author(s) Disclosures</h3> None

Read full abstract

The Community of Inquiry (CoI) framework [1] has been broadly used to analyse learning experience in online discussion forums for two decades. Cognitive presence, which is a primary dimension of the CoI framework, manifests the reflection of (re)constructing knowledge and problem-solving processes in the learning experience [2]. Researchers doing text analysis using machine learning techniques are making promising contributions to analysing phases of cognitive presence automatically [3]–[5] in online discussions. However, most studies of automated cognitive analysis focus on improving the accuracy and reliability of the classifiers. They ignored that another purpose of applying machine learning techniques in educational research should be to pinpoint research bias that scholars neither intended to nor can have found without computer support. This session will present the example of ‘research bias’ discovered from both manual and automated classification of cognitive phases, provoking scholars to rethink and improve the conflicting part in the taxonomies of cognitive presence under MOOC context.  The manual-classification rubric that used to label discussion messages of a target MOOC combines Garrison, Anderson and Archer’s [2] scheme with Park’s [6] revised version. The rubric describes four phases of cognitive presence (i.e. triggering event, exploration, integration and resolution), and indicators of each phase in online discussions. We reported the average inter-rater reliability between two human raters achieved 95.4% agreement (N = 1002) with a Cohen’s weighted kappa of 0.96. Interestingly, we found the average inter-rater reliability decreased to 80.1% after increasing the size of data samples (N = 1918) and the number of human raters to three. After training the automated classifiers to predict phases of cognitive presence, the confusion matrix implies that most of the disagreements between computer raters occurred between adjacent phases of cognitive presence. The disagreements between human raters also have the same problems. We assume the additional categories may exist between cognitive phases in such MOOC discussion messages. These details will be discussed during the presentation.  References [1] D. Garrison, T. Anderson, and W. Archer, “Critical Inquiry in a Text-Based Environment: Computer Conferencing in Higher Education,” Internet High. Educ., vol. 2, no. 2, pp. 87–105, 1999. [2] D. Garrison, T. Anderson, and W. Archer, “Critical thinking, cognitive presence, and computer conferencing in distance education,” Am. J. Distance Educ., vol. 15, no. 1, pp. 7–23, 2001. [3] V. Kovanović, S. Joksimović, D. Gašević, and M. Hatala, “Automated cognitive presence detection in online discussion transcripts,” in Automated cognitive presence detection in online discussion transcripts’ CEUR Workshop Proceedings (vol. 1137), 2014. [4] V. Kovanović et al., “Towards automated content analysis of discussion transcripts,” Proc. Sixth Int. Conf. Learn. Anal. Knowl. - LAK ’16, pp. 15–24, 2016. [5] E. Farrow, J. Moore, and D. Gasevic, “Analysing discussion forum data: a replication study avoiding data contamination,” 9th Int. Learn. Anal. Knowl. Conf., no. March, 2019. [6] C. Park, “Replicating the Use of a Cognitive Presence Measurement Tool,” J. Interact. Online Learn., vol. 8, no. 2, pp. 140–155, 2009.

Read full abstract

Average Interrater Reliability Research Articles

Articles published on Average Interrater Reliability

Reliability of pediatric Rome IV criteria for the diagnosis of disorders of gut-brain interaction.

Reliability of landmark identification for analysis of the temporomandibular joint in real-time MRI

The development and preliminary evaluation of the Genetic Counseling Skills Checklist.

Interventions for Health Promotion and Obesity Prevention for Children And Adolescents With Developmental Disabilities (DD)

Development and Validation of Comic-based Learning Module in Physics

Comparing human coding to two natural language processing algorithms in aspirations of people affected by Duchenne Muscular Dystrophy

Presbylarynx: validation of a classification based on morphological characteristics.

Reliability assessment and validation of the post-acne hyperpigmentation index (PAHPI) in a population from Sub-Saharan Africa in Senegal

Validity and Inter-rater Reliability of the Scoring Rubrics for the Science Teacher TPACK Test Instrument

Measuring behavior change technique delivery and receipt in physical activity behavioral interventions.

Avoiding the Deep Plantar Arterial Arch in Transmetatarsal Amputations: A Cadaver Study.

Determination of Interrater Reliability of a Universal Evaluator Rubric to Assess Student Pharmacist Communication Skills

The Outcomes for Human Trafficking Instrument: Validity and Reliability Testing

Analysing EHR navigation patterns and digital workflows among physicians during ICU pre-rounds.

Construction and Validation of a Video Coding Tool for an Intervention to Improve Parental Feeding

Development and Reliability of the Comprehensive Crisis Plan Checklist, 2nd Edition

Automated analysis of cognitive presence in MOOC discussions

How Reliable Are Therapeutic Competence Ratings? Results of a Systematic Review and Meta-Analysis

Location of the Deep Plantar Artery: A Cadaveric Study.

Assessing treatment integrity in personalized CBT: the inventory of therapeutic interventions and skills

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Average Interrater Reliability Research Articles

Articles published on Average Interrater Reliability

Reliability of pediatric Rome IV criteria for the diagnosis of disorders of gut-brain interaction.

Reliability of landmark identification for analysis of the temporomandibular joint in real-time MRI

The development and preliminary evaluation of the Genetic Counseling Skills Checklist.

Interventions for Health Promotion and Obesity Prevention for Children And Adolescents With Developmental Disabilities (DD)

Development and Validation of Comic-based Learning Module in Physics

Comparing human coding to two natural language processing algorithms in aspirations of people affected by Duchenne Muscular Dystrophy

Presbylarynx: validation of a classification based on morphological characteristics.

Reliability assessment and validation of the post-acne hyperpigmentation index (PAHPI) in a population from Sub-Saharan Africa in Senegal

Validity and Inter-rater Reliability of the Scoring Rubrics for the Science Teacher TPACK Test Instrument

Measuring behavior change technique delivery and receipt in physical activity behavioral interventions.

Avoiding the Deep Plantar Arterial Arch in Transmetatarsal Amputations: A Cadaver Study.

Determination of Interrater Reliability of a Universal Evaluator Rubric to Assess Student Pharmacist Communication Skills

The Outcomes for Human Trafficking Instrument: Validity and Reliability Testing

Analysing EHR navigation patterns and digital workflows among physicians during ICU pre-rounds.

Construction and Validation of a Video Coding Tool for an Intervention to Improve Parental Feeding

Development and Reliability of the Comprehensive Crisis Plan Checklist, 2nd Edition

Automated analysis of cognitive presence in MOOC discussions

How Reliable Are Therapeutic Competence Ratings? Results of a Systematic Review and Meta-Analysis

Location of the Deep Plantar Artery: A Cadaveric Study.

Assessing treatment integrity in personalized CBT: the inventory of therapeutic interventions and skills