Generalizability Analysis Research Articles

본 연구는 문헌 및 현장에서 얻은 생물자원지식들의 가치를 평가하기 위해 전문가들을 대상으로 실시한 델파이 조사 (Delphi method)의 신뢰성을 평가하였다. 델파이조사는 전문가들이 다른 사람의 의견과 관계없이 독립적으로 전문성 있는 평가를 내린 다음에 다음 단계에서 다른 전문가들의 의견을 참고하며 자신의 의견을 수정하는 절차를 갖는다. 본 연구에서는 문헌에서 얻은 전통지식 100건, 현장에서 취득한 지식 100건 등, 모두 200건을 우선 선정한 후 전문가 6인을 선정하여 각 지식의 가치에 대하여 두 차례 평가하도록 하였다. 그 결과 두 차례의 평가점수는 연관성이 매우 높으면서 2차에서는 다른 전문가들의 의견을 수용해 어느 정도 자체수정이 발생하여 각 문항에 대한 평가점수의 표준편차가 줄어들었다. 본 조사의 신뢰성 (reliability)을 파악하기 위해 일반적인 신뢰도 계수인 크론바하 알파와 함께 일반화가능도 (generalizability) 계수를 구하였다. 이 두 신뢰도 분석을 통해 2차 평가 후 평가의 신뢰도가 상승하여 전문가에 의한 델파이 조사의 신뢰도가 매우 높다는 사실을 지지하였으나 일반화 가능도 분석 결과를 해석하는 과정에서 다른 결과를 유추할 수 있었다. 신뢰도계수가 증가하였음에도 불구하고 평가자간의 편차는 증가하여 신뢰도가 높아진 것은 평가가 상향되고 평균에 회귀하는 경향으로 잔차변동이 줄어서이지 평가자간의 의견수렴이 이루어진 결과로 볼 수는 없다는 사실이었다. 이러한 결과를 토대로 신뢰도 계수와 함께 평가자 간의 분산을 파악하여 델파이조사의 추가적인 단계 (round)가 필요함을 제시하였다. In the knowledge and information age, to discover and protect Intellectual Properties would be very important for their economic value as a major growth engine. This study evaluated the reliability of a Delphi survey conducted by experts to assess the value of agricultural resources knowledge obtained from literature reviews and field interviews. Delphi method is collecting the opinions of experts for several rounds repeatedly, in the next round the experts have chance to modify their opinion. Scores between two rounds are highly correlated and standard deviations are declined for second round to imply that some correction of their evaluations are made. To check reliability of Delphi survey of two rounds Cronbach's reliability coefficient and Generalizability coefficient are derived. The Cronbach alpha's supported the reliability of the method, but the Generalizability analysis revealed some unexpected results while checking the variance components of sources of measurement errors. Despite the increased reliability coefficients, the deviations between the raters are increased which means that additional rounds are required to get consensus, the goal of Delphi research.

Read full abstract

AbstractGeneralizability theory (G theory) provides a broad conceptual framework for social sciences such as psychology and education, and a comprehensive construct for numerous measurement events by using analysis of variance, a strong statistical method. G theory, as an extension of both classical test theory and analysis of variance, is a model which can deal with multiple sources of error. In conducting the analysis of the G theory, there are several software programs that can be used such as GENOVA, SPSS, SAS, EduG, and G-String. In this study, the general perspectives of G theory are first explained broadly. Then, the SPSS and EduG software programs are used to conduct generalizability analyses on the data obtained from the answers of 30 students (p) to nine open-ended questions (i) as rated by three raters (r). There are three different designs in the study. Two of them are random effects designs, pxixr and pxi:r, and the last one is pxixr design using a fixed rater . According to the findings from the study, SPSS and EduG give the same results for variance component estimates as well as for G (Generalizability) and D (Decision) studies of all designs, as expected. Besides comparing the program outputs, their weaknesses and strengths were also discussed regarding different designs and data sets in this study.Keywords: Generalizability Theory * G Study * D Study * SPSS * EduGG theory has formed a comprehensive structure by employing variance analysis which provides a broad conceptual framework for social sciences such as psychology and education (Brennan, 2000, 2001a; Cronbach, Gleser, Nanda, & Rajaratnam, 1972; Shavelson & Webb, 1991). It is also a powerful statistical tool for situations where there are numerous measurements. The theory, as an extension of classical test theory and variance analysis, stands as a model where multiple sources of error can be handled (Brennan, 2001a; Shavelson & Webb, 1991).Generalizability (G) TheoryThe reliability of measurement results in education and psychology was previously examined using classical test theory (CTT) in general. It is assumed in CTT that the observed score is composed of the actual score with no separable score for error. The restriction of this assumption, especially in performance measurements where the probability of the existence of more than one source of error is high, reveals the importance of G theory in which more than one source of error is handled and can be predicted simultaneously (Brennan, 2000). Another advantage of G theory in using performance assessment is that while there is a restrictive parallel assumption in CTT, randomly parallel assumption is adopted in G theory (Brennan, 2011; Kretchmar, 2006). The main aim of G theory is to generalize the scores of a specific measurement tool from a specific group to the universe of generalization which consist of 1) the universe of admissible observations and generalizability studies (G studies), 2) the universe of G studies and decision studies (D studies). While G studies provide an estimate of the generalizability coefficient of variances from all facets and this coefficient includes the examinee's universe score, D studies enable one to examine the interactions among all applicable facets (tasks, raters, observations, etc.) and the subject of measurement for calculating the dependability coefficient (Brennan, 2000; Crocker & Algina, 1986; Hsu, 2012).G theory has four main advantages compared to CTT. 1) It provides simultaneous evaluation of test-retest reliability, internal consistency, inter-rater reliability, and convergent validity. 2) It enables estimates of both individual measurement facets and interaction effects. 3) When assessing an examinee's performance, it gives information about the quality of their absolute structural level of knowledge as well as ranking this information in order. 4) It allows researchers to optimize the reliability of an assessment within the cost constraints of time and money. …

Read full abstract

Generalizability Analysis Research Articles

Related Topics

Articles published on Generalizability Analysis

MP015: Daily encounter cards: evaluating the quality of documented assessments

An Investigation of the Generalizability of Medical School Grades

Observación y análisis de las acciones a balón parado en el fútbol profesional

Competency Evaluations in the Next Accreditation System: Contributing to Guidelines and Implications

Selecting and Simplifying: Rater Performance and Behavior When Considering Multiple Competencies

Measuring multiple neuromuscular activation using EMG - a generalizability analysis.

How Many Responses Do We Need? Using Generalizability Analysis to Estimate Minimum Necessary Response Rates for Online Student Evaluations

생물자원 전통지식 추출을 위한 델파이조사의 신뢰성 연구

Using Generalizability Analysis to Establish Guidelines for Designing Horizontally Integrated Anatomy Assessments

Improving QST Reliability—More Raters, Tests, or Occasions? A Multivariate Generalizability Study

Development and Validation of a Trustworthy Multisource Feedback Instrument to Support Nurse Appraisals

Comparing the Effectiveness of SPSS and EduG using Different Designs for Generalizability Theory

Development and validation of an instrument for measuring the quality of teamwork in teaching teams in postgraduate medical training (TeamQ).

Evaluating Procedures for Reducing Measurement Error in Math Curriculum-Based Measurement Probes

Confirmatory Factor Analysis of the System for Evaluation of Teaching Qualities (SETQ) in Graduate Medical Training.

Are Teacher Perspectives Useful? Incorporating EFL Teacher Feedback in the Development of a Large-Scale International English Test

A procedural skills OSCE: assessing technical and non-technical skills of internal medicine residents.

The versatility of generalizability theory as a tool for exploring and controlling measurement error

THE THINKING-ABOUT-DERIVATIVE TEST FOR UNDERGRADUATE STUDENTS: DEVELOPMENT AND VALIDATION

Reliability and benefits of medical student peers in rating complex clinical skills

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Generalizability Analysis Research Articles

Related Topics

Articles published on Generalizability Analysis

MP015: Daily encounter cards: evaluating the quality of documented assessments

An Investigation of the Generalizability of Medical School Grades

Observación y análisis de las acciones a balón parado en el fútbol profesional

Competency Evaluations in the Next Accreditation System: Contributing to Guidelines and Implications

Selecting and Simplifying: Rater Performance and Behavior When Considering Multiple Competencies

Measuring multiple neuromuscular activation using EMG - a generalizability analysis.

How Many Responses Do We Need? Using Generalizability Analysis to Estimate Minimum Necessary Response Rates for Online Student Evaluations

생물자원 전통지식 추출을 위한 델파이조사의 신뢰성 연구

Using Generalizability Analysis to Establish Guidelines for Designing Horizontally Integrated Anatomy Assessments

Improving QST Reliability—More Raters, Tests, or Occasions? A Multivariate Generalizability Study

Development and Validation of a Trustworthy Multisource Feedback Instrument to Support Nurse Appraisals

Comparing the Effectiveness of SPSS and EduG using Different Designs for Generalizability Theory

Development and validation of an instrument for measuring the quality of teamwork in teaching teams in postgraduate medical training (TeamQ).

Evaluating Procedures for Reducing Measurement Error in Math Curriculum-Based Measurement Probes

Confirmatory Factor Analysis of the System for Evaluation of Teaching Qualities (SETQ) in Graduate Medical Training.

Are Teacher Perspectives Useful? Incorporating EFL Teacher Feedback in the Development of a Large-Scale International English Test

A procedural skills OSCE: assessing technical and non-technical skills of internal medicine residents.

The versatility of generalizability theory as a tool for exploring and controlling measurement error

THE THINKING-ABOUT-DERIVATIVE TEST FOR UNDERGRADUATE STUDENTS: DEVELOPMENT AND VALIDATION

Reliability and benefits of medical student peers in rating complex clinical skills