Description Of Dataset Research Articles

Canada is exposed to rare but potentially destructive earthquakes that threaten densely settled metropolitan centers in many parts of the country. To assess the impacts and consequences of future natural-hazard events and help advance policy goals and objectives of the Sendai Framework for Disaster Risk Reduction, Natural Resources Canada, through a collaborative partnership with the Global Earthquake Model Foundation, produced a national seismic risk model. Developing this model has required the creation of a national exposure inventory, Canadian-specific fragility and vulnerability curves, and significant simplification of the Canadian Seismic Hazard Model which forms the basis for the design seismic hazard values of the National Building Code of Canada. Using the Global Earthquake Model Foundation’s OpenQuake Engine, probabilistic stochastic risk modeling is completed under baseline and simulated retrofit conditions to assess seismic risk at the neighborhood level for all settled areas in Canada. Output risk metrics include the expected immediate physical impacts of earthquake events such as building damage, casualties, and direct economic losses. This article documents the technical details of the modeling approach including a description of novel data sets in use, a summary of the extensive sensitivity testing undertaken, and characterization of quality control implemented in the absence of usable validating earthquake loss data. The results from this model, such as loss exceedance curves and annual average losses, provide an open, accessible and quantitative base of evidence for decision-making at local, regional, and national levels. As a large country with a complex seismic hazard model and dispersed populations, this Canadian study is unique. However, the challenges faced and solutions offered are likely to be of interest to other nations pursuing similar programs.

Read full abstract

Background: While skin cancers are less prevalent in people with skin of color, they are more often diagnosed at later stages and have a poorer prognosis. The use of artificial intelligence (AI) models can potentially improve early detection of skin cancers; however, the lack of skin color diversity in training datasets may only widen the pre-existing racial discrepancies in dermatology. Objective: The aim of this study was to systematically review the technique, quality, accuracy, and implications of studies using AI models trained or tested in populations with skin of color for classification of pigmented skin lesions. Methods: PubMed was used to identify any studies describing AI models for classification of pigmented skin lesions. Only studies that used training datasets with at least 10% of images from people with skin of color were eligible. Outcomes on study population, design of AI model, accuracy, and quality of the studies were reviewed. Results: Twenty-two eligible articles were identified. The majority of studies were trained on datasets obtained from Chinese (7/22), Korean (5/22), and Japanese populations (3/22). Seven studies used diverse datasets containing Fitzpatrick skin type I–III in combination with at least 10% from black Americans, Native Americans, Pacific Islanders, or Fitzpatrick IV–VI. AI models producing binary outcomes (e.g., benign vs. malignant) reported an accuracy ranging from 70% to 99.7%. Accuracy of AI models reporting multiclass outcomes (e.g., specific lesion diagnosis) was lower, ranging from 43% to 93%. Reader studies, where dermatologists’ classification is compared with AI model outcomes, reported similar accuracy in one study, higher AI accuracy in three studies, and higher clinician accuracy in two studies. A quality review revealed that dataset description and variety, benchmarking, public evaluation, and healthcare application were frequently not addressed. Conclusions: While this review provides promising evidence of accurate AI models in populations with skin of color, the majority of the studies reviewed were obtained from East Asian populations and therefore provide insufficient evidence to comment on the overall accuracy of AI models for darker skin types. Large discrepancies remain in the number of AI models developed in populations with skin of color (particularly Fitzpatrick type IV–VI) compared with those of largely European ancestry. A lack of publicly available datasets from diverse populations is likely a contributing factor, as is the inadequate reporting of patient-level metadata relating to skin color in training datasets.

Read full abstract

Description Of Dataset Research Articles

Related Topics

Articles published on Description Of Dataset

The Office of Water Prediction's Analysis of Record for Calibration, version 1.1: Dataset description and precipitation evaluation

General Intelligent Dataset Description Method and Application

Application of data fusion for automated detection of children with developmental and mental disorders: A systematic review of the last decade

Identifying and sharing per-and polyfluoroalkyl substances hot-spot areas and exposures in drinking water

Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

A Dataset of Scalp EEG Recordings of Alzheimer’s Disease, Frontotemporal Dementia and Healthy Subjects from Routine EEG

Two sides of the same coin: Kernel partial least-squares (KPLS) for linear and non-linear multivariate calibration. A tutorial

A national seismic risk model for Canada: Methodology and scientific basis

Metadata implementation and data discoverability: A survey on university libraries' Dataverse portals

GIR dataset: A geometry and real impulse response dataset for machine learning research in acoustics

Multiclass Support Vector Data Description in Extreme Learning Machine Feature Space

Customer Churn Prediction Based on the Decision Tree and Random Forest Model

Still a “spectator”? Capabilities of the Spanish REPER and Spain’s influence in the Council of the EU

Artificial Intelligence for the Classification of Pigmented Skin Lesions in Populations with Skin of Color: A Systematic Review

Classification and Segmentation of Diabetic Retinopathy: A Systemic Review

Code4ML: a large-scale dataset of annotated Machine Learning code.

Using Ontologies to Create Machine-Actionable Datasets: Two Case Studies

Shared or Self-rule? Regional Legislative Initiatives in Multi-level Spain, 1979-2021

Language-Based Image Manipulation Built on Language-Guided Ranking

Supplementary material

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Description Of Dataset Research Articles

Related Topics

Articles published on Description Of Dataset

The Office of Water Prediction's Analysis of Record for Calibration, version 1.1: Dataset description and precipitation evaluation

General Intelligent Dataset Description Method and Application

Application of data fusion for automated detection of children with developmental and mental disorders: A systematic review of the last decade

Identifying and sharing per-and polyfluoroalkyl substances hot-spot areas and exposures in drinking water

Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

A Dataset of Scalp EEG Recordings of Alzheimer’s Disease, Frontotemporal Dementia and Healthy Subjects from Routine EEG

Two sides of the same coin: Kernel partial least-squares (KPLS) for linear and non-linear multivariate calibration. A tutorial

A national seismic risk model for Canada: Methodology and scientific basis

Metadata implementation and data discoverability: A survey on university libraries' Dataverse portals

GIR dataset: A geometry and real impulse response dataset for machine learning research in acoustics

Multiclass Support Vector Data Description in Extreme Learning Machine Feature Space

Customer Churn Prediction Based on the Decision Tree and Random Forest Model

Still a “spectator”? Capabilities of the Spanish REPER and Spain’s influence in the Council of the EU

Artificial Intelligence for the Classification of Pigmented Skin Lesions in Populations with Skin of Color: A Systematic Review

Classification and Segmentation of Diabetic Retinopathy: A Systemic Review

Code4ML: a large-scale dataset of annotated Machine Learning code.

Using Ontologies to Create Machine-Actionable Datasets: Two Case Studies

Shared or Self-rule? Regional Legislative Initiatives in Multi-level Spain, 1979-2021

Language-Based Image Manipulation Built on Language-Guided Ranking

Supplementary material