Using hierarchical cluster models to systematically identify groups of jobs with similar occupational questionnaire response patterns to assist rule-based expert exposure assessment in population-based studies.

Melissa C Friesen ,Debra T Silverman ,David C Wheeler ,Alison Johnson,Karla Armenti ,Roel Vermeulen,Anjoeka Pronk,Joanne S Colt ,Molly Schwenn,Margaret R Karagas ,Igor Burstyn,Susan M Shortreed ,Dalsu Baris,Kai Yu

doi:10.1093/annhyg/meu101

Abstract

Rule-based expert exposure assessment based on questionnaire response patterns in population-based studies improves the transparency of the decisions. The number of unique response patterns, however, can be nearly equal to the number of jobs. An expert may reduce the number of patterns that need assessment using expert opinion, but each expert may identify different patterns of responses that identify an exposure scenario. Here, hierarchical clustering methods are proposed as a systematic data reduction step to reproducibly identify similar questionnaire response patterns prior to obtaining expert estimates. As a proof-of-concept, we used hierarchical clustering methods to identify groups of jobs (clusters) with similar responses to diesel exhaust-related questions and then evaluated whether the jobs within a cluster had similar (previously assessed) estimates of occupational diesel exhaust exposure. Using the New England Bladder Cancer Study as a case study, we applied hierarchical cluster models to the diesel-related variables extracted from the occupational history and job- and industry-specific questionnaires (modules). Cluster models were separately developed for two subsets: (i) 5395 jobs with ≥1 variable extracted from the occupational history indicating a potential diesel exposure scenario, but without a module with diesel-related questions; and (ii) 5929 jobs with both occupational history and module responses to diesel-relevant questions. For each subset, we varied the numbers of clusters extracted from the cluster tree developed for each model from 100 to 1000 groups of jobs. Using previously made estimates of the probability (ordinal), intensity (µg m(-3) respirable elemental carbon), and frequency (hours per week) of occupational exposure to diesel exhaust, we examined the similarity of the exposure estimates for jobs within the same cluster in two ways. First, the clusters' homogeneity (defined as >75% with the same estimate) was examined compared to a dichotomized probability estimate (<5 versus ≥5%; <50 versus ≥50%). Second, for the ordinal probability metric and continuous intensity and frequency metrics, we calculated the intraclass correlation coefficients (ICCs) between each job's estimate and the mean estimate for all jobs within the cluster. Within-cluster homogeneity increased when more clusters were used. For example, ≥80% of the clusters were homogeneous when 500 clusters were used. Similarly, ICCs were generally above 0.7 when ≥200 clusters were used, indicating minimal within-cluster variability. The most within-cluster variability was observed for the frequency metric (ICCs from 0.4 to 0.8). We estimated that using an expert to assign exposure at the cluster-level assignment and then to review each job in non-homogeneous clusters would require ~2000 decisions per expert, in contrast to evaluating 4255 unique questionnaire patterns or 14983 individual jobs. This proof-of-concept shows that using cluster models as a data reduction step to identify jobs with similar response patterns prior to obtaining expert ratings has the potential to aid rule-based assessment by systematically reducing the number of exposure decisions needed. While promising, additional research is needed to quantify the actual reduction in exposure decisions and the resulting homogeneity of exposure estimates within clusters for an exposure assessment effort that obtains cluster-level expert assessments as part of the assessment process.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using hierarchical cluster models to systematically identify groups of jobs with similar occupational questionnaire response patterns to assist rule-based expert exposure assessment in population-based studies.

Abstract

Talk to us

Similar Papers

More From: The Annals of Occupational Hygiene

Lead the way for us

Journal: The Annals of Occupational Hygiene	Publication Date: Dec 3, 2014
Citations: 10

Similar Papers

Data from Nitrated polycyclic aromatic hydrocarbon (nitro-PAH) signatures and somatic mutations in diesel exhaust-exposed bladder tumors
Nicole Gonzalez ... Stella Koutros
-
Nicole Gonzalez, et. al.Nicole Gonzalez ... Stella Koutros
16 Sep 2024
16 Sep 2024

Data from Nitrated Polycyclic Aromatic Hydrocarbon (Nitro-PAH) Signatures and Somatic Mutations in Diesel Exhaust-Exposed Bladder Tumors
Nina Rao ... Nicole Gonzalez
-
Nina Rao, et. al.Nina Rao ... Nicole Gonzalez
01 Jun 2023
01 Jun 2023

Data from Nitrated Polycyclic Aromatic Hydrocarbon (Nitro-PAH) Signatures and Somatic Mutations in Diesel Exhaust-Exposed Bladder Tumors
Nicole Gonzalez ... Stella Koutros
-
Nicole Gonzalez, et. al.Nicole Gonzalez ... Stella Koutros
17 May 2024
17 May 2024

Data from Nitrated Polycyclic Aromatic Hydrocarbon (Nitro-PAH) Signatures and Somatic Mutations in Diesel Exhaust-Exposed Bladder Tumors
Nina Rao ... Nicole Gonzalez
-
Nina Rao, et. al.Nina Rao ... Nicole Gonzalez
19 Apr 2023
19 Apr 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using hierarchical cluster models to systematically identify groups of jobs with similar occupational questionnaire response patterns to assist rule-based expert exposure assessment in population-based studies.

Abstract

Talk to us

Similar Papers

More From: The Annals of Occupational Hygiene