Assessment of analysis-of-variance-based methods to quantify the random variations of observers in medical imaging measurements: guidelines to the investigator.

William F A Klein Zeggelink,Kenneth G A Gilhuijs,Augustinus A M Hart

doi:10.1118/1.1759798

Abstract

The random variations of observers in medical imaging measurements negatively affect the outcome of cancer treatment, and should be taken into account during treatment by the application of safety margins that are derived from estimates of the random variations. Analysis-of-variance- (ANOVA-) based methods are the most preferable techniques to assess the true individual random variations of observers, but the number of observers and the number of cases must be taken into account to achieve meaningful results. Our aim in this study is twofold. First, to evaluate three representative ANOVA-based methods for typical numbers of observers and typical numbers of cases. Second, to establish guidelines to the investigator to determine which method, how many observers, and which number of cases are required to obtain the a priori chosen performance. The ANOVA-based methods evaluated in this study are an established technique (pairwise differences method: PWD), a new approach providing additional statistics (residuals method: RES), and a generic technique that uses restricted maximum likelihood (REML) estimation. Monte Carlo simulations were performed to assess the performance of the ANOVA-based methods, which is expressed by their accuracy (closeness of the estimates to the truth), their precision (standard error of the estimates), and the reliability of their statistical test for the significance of a difference in the random variation of an observer between two groups of cases. The highest accuracy is achieved using REML estimation, but for datasets of at least 50 cases or arrangements with 6 or more observers, the differences between the methods are negligible, with deviations from the truth well below +/-3%. For datasets up to 100 cases, it is most beneficial to increase the number of cases to improve the precision of the estimated random variations, whereas for datasets over 100 cases, an improvement in precision is most efficiently achieved by increasing the number of observers. For datasets of at least 50 cases, the standard error ranges between 30% or less with 3 observers down to 10% or less with 8 observers, and the differences in precision between the methods are negligible. The F test (PWD) is very anticonservative and should not be used, while the t test (RES) is reliable for datasets of at least 2 x 50 cases evaluated by 4 or more observers. The likelihood-ratio-test (REML estimation) consistently indicates the significance of a difference in the random variation of an observer between two groups of cases, regardless of the number of cases, and regardless of the number of observers. If a statistical package to perform REML estimation is available, and the investigator feels confident using it, this is the preferred method for studies that involve less than 50 cases evaluated by less than 6 observers. Otherwise, the RES method is an excellent alternative, because of its straightforward implementation, its completeness with respect to the provided statistics, and its overall sufficient accuracy, precision, and reliability of the provided statistical test. If neither the RES method nor REML estimation can provide sufficient performance, either more observers or more cases must be included.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessment of analysis-of-variance-based methods to quantify the random variations of observers in medical imaging measurements: guidelines to the investigator.

Abstract

Talk to us

Similar Papers

More From: Medical physics

Lead the way for us

Journal: Medical physics	Publication Date: Jun 22, 2004
Citations: 4

Similar Papers

Restricted maximum likelihood estimation under Eisenhart model Ill
K.R Lee ... C.H Kapadia
Statistica Neerlandica | VOL. 45
K.R Lee, et. al.K.R Lee ... C.H Kapadia
01 Sep 1991
Statistica Neerlandica | VOL. 45

Closed-form approximations to the REML estimator of a variance ratio (or heritability) in a mixed linear model.
Brent D Burch ... Ian R Harris
Biometrics | VOL. 57
Brent D Burch, et. al.Brent D Burch ... Ian R Harris
01 Dec 2001
Biometrics | VOL. 57

Maximum likelihood and restricted maximum likelihood estimation for a class of Gaussian Markov random fields
Victor De Oliveira ... Marco A R Ferreira
Metrika | VOL. 74
Victor De Oliveira, et. al.Victor De Oliveira ... Marco A R Ferreira
18 Dec 2009
Metrika | VOL. 74

REML estimation: asymptotic behavior and related topics
Jiming Jiang
The Annals of Statistics | VOL. 24
Jiming JiangJiming Jiang
01 Feb 1996
The Annals of Statistics | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessment of analysis-of-variance-based methods to quantify the random variations of observers in medical imaging measurements: guidelines to the investigator.

Abstract

Talk to us

Similar Papers

More From: Medical physics