Comparison of ordinal and nominal classification trees to predict ordinal expert-based occupational exposure estimates in a case-control study.

David C Wheeler ,Dalsu Baris,Molly Schwenn,Kellie J Archer ,Igor Burstyn,Karla Armenti ,Joanne S Colt ,Alison Johnson,Patricia A Stewart ,Margaret R Karagas ,Debra T Silverman ,Kai Yu,Melissa C Friesen

doi:10.1093/annhyg/meu098

Abstract

To evaluate occupational exposures in case-control studies, exposure assessors typically review each job individually to assign exposure estimates. This process lacks transparency and does not provide a mechanism for recreating the decision rules in other studies. In our previous work, nominal (unordered categorical) classification trees (CTs) generally successfully predicted expert-assessed ordinal exposure estimates (i.e. none, low, medium, high) derived from occupational questionnaire responses, but room for improvement remained. Our objective was to determine if using recently developed ordinal CTs would improve the performance of nominal trees in predicting ordinal occupational diesel exhaust exposure estimates in a case-control study. We used one nominal and four ordinal CT methods to predict expert-assessed probability, intensity, and frequency estimates of occupational diesel exhaust exposure (each categorized as none, low, medium, or high) derived from questionnaire responses for the 14983 jobs in the New England Bladder Cancer Study. To replicate the common use of a single tree, we applied each method to a single sample of 70% of the jobs, using 15% to test and 15% to validate each method. To characterize variability in performance, we conducted a resampling analysis that repeated the sample draws 100 times. We evaluated agreement between the tree predictions and expert estimates using Somers' d, which measures differences in terms of ordinal association between predicted and observed scores and can be interpreted similarly to a correlation coefficient. From the resampling analysis, compared with the nominal tree, an ordinal CT method that used a quadratic misclassification function and controlled tree size based on total misclassification cost had a slightly better predictive performance that was statistically significant for the frequency metric (Somers' d: nominal tree = 0.61; ordinal tree = 0.63) and similar performance for the probability (nominal = 0.65; ordinal = 0.66) and intensity (nominal = 0.65; ordinal = 0.65) metrics. The best ordinal CT predicted fewer cases of large disagreement with the expert assessments (i.e. no exposure predicted for a job with high exposure and vice versa) compared with the nominal tree across all of the exposure metrics. For example, the percent of jobs with expert-assigned high intensity of exposure that the model predicted as no exposure was 29% for the nominal tree and 22% for the best ordinal tree. The overall agreements were similar across CT models; however, the use of ordinal models reduced the magnitude of the discrepancy when disagreements occurred. As the best performing model can vary by situation, researchers should consider evaluating multiple CT methods to maximize the predictive performance within their data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of ordinal and nominal classification trees to predict ordinal expert-based occupational exposure estimates in a case-control study.

Abstract

Talk to us

Similar Papers

More From: The Annals of occupational hygiene

Lead the way for us

Journal: The Annals of occupational hygiene	Publication Date: Nov 27, 2014
Citations: 20

Similar Papers

Inside the black box: starting to uncover the underlying decision rules used in a one-by-one expert assessment of occupational exposure in case-control studies
David C Wheeler ... Molly Schwenn
Occupational and Environmental Medicine | VOL. 70
David C Wheeler, et. al.David C Wheeler ... Molly Schwenn
15 Nov 2012
Occupational and Environmental Medicine | VOL. 70

Comparison of Algorithm-based Estimates of Occupational Diesel Exhaust Exposure to Those of Multiple Independent Raters in a Population-based Case–Control Study
...
The Annals of Occupational Hygiene | VOL. 57
, et. al. ...
25 Nov 2012
The Annals of Occupational Hygiene | VOL. 57

Occupational heat exposure and prostate cancer risk: a pooled analysis of case-control studies
Alice Hinchliffe ... Florence Menegaux
ISEE Conference Abstracts | VOL. 2022
Alice Hinchliffe, et. al.Alice Hinchliffe ... Florence Menegaux
18 Sep 2022
ISEE Conference Abstracts | VOL. 2022

Estimation of Source-Specific Occupational Benzene Exposure in a Population-Based Case-Control Study of Non-Hodgkin Lymphoma.
Pamela J Dopart ... Qing Lan
Annals of work exposures and health | VOL. 63
Pamela J Dopart, et. al.Pamela J Dopart ... Qing Lan
27 Aug 2019
Annals of work exposures and health | VOL. 63

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of ordinal and nominal classification trees to predict ordinal expert-based occupational exposure estimates in a case-control study.

Abstract

Talk to us

Similar Papers

More From: The Annals of occupational hygiene