Quality Ranking Research Articles

Medical self-diagnostic tools (or symptom checkers) are becoming an integral part of digital health and our daily lives, whereby patients are increasingly using them to identify the underlying causes of their symptoms. As such, it is essential to rigorously investigate and comprehensively report the diagnostic performance of symptom checkers using standard clinical and scientific approaches. This study aims to evaluate and report the accuracies of a few known and new symptom checkers using a standard and transparent methodology, which allows the scientific community to cross-validate and reproduce the reported results, a step much needed in health informatics. We propose a 4-stage experimentation methodology that capitalizes on the standard clinical vignette approach to evaluate 6 symptom checkers. To this end, we developed and peer-reviewed 400 vignettes, each approved by at least 5 out of 7 independent and experienced primary care physicians. To establish a frame of reference and interpret the results of symptom checkers accordingly, we further compared the best-performing symptom checker against 3 primary care physicians with an average experience of 16.6 (SD 9.42) years. To measure accuracy, we used 7 standard metrics, including M1 as a measure of a symptom checker's or a physician's ability to return a vignette's main diagnosis at the top of their differential list, F1-score as a trade-off measure between recall and precision, and Normalized Discounted Cumulative Gain (NDCG) as a measure of a differential list's ranking quality, among others. The diagnostic accuracies of the 6 tested symptom checkers vary significantly. For instance, the differences in the M1, F1-score, and NDCG results between the best-performing and worst-performing symptom checkers or ranges were 65.3%, 39.2%, and 74.2%, respectively. The same was observed among the participating human physicians, whereby the M1, F1-score, and NDCG ranges were 22.8%, 15.3%, and 21.3%, respectively. When compared against each other, physicians outperformed the best-performing symptom checker by an average of 1.2% using F1-score, whereas the best-performing symptom checker outperformed physicians by averages of 10.2% and 25.1% using M1 and NDCG, respectively. The performance variation between symptom checkers is substantial, suggesting that symptom checkers cannot be treated as a single entity. On a different note, the best-performing symptom checker was an artificial intelligence (AI)-based one, shedding light on the promise of AI in improving the diagnostic capabilities of symptom checkers, especially as AI keeps advancing exponentially.

Read full abstract

Abstract. Between 1986 and 1995, 429 hydrofracturing tests have been carried out in six now-abandoned coal mines and two coal bed methane boreholes at depths between 600 and 1950 m within the greater Ruhr region in western Germany. From these tests, stress magnitudes and orientations of the stress tensor are derived. The majority of hydrofracturing tests were carried out from mine galleries away from mine workings in a relatively undisturbed rock mass. These data along with detailed information have been disclosed recently. In combination with already published material, we provide the first comprehensive stress database of the greater Ruhr region. Our study summarises the results of the extensive in situ stress test campaign and assigns quality to each data record using the established quality ranking schemes of the World Stress Map project. The stress magnitudes suggest predominantly strike-slip stress regime, where the magnitude of the minimum horizontal stress, Shmin, is half of the magnitude of the maximum horizontal stress, SHmax, implying that the horizontal differential stress is high. We observe no particular change in the stress gradient at depth throughout the Carboniferous layers and no significant difference between tests carried out in coal mines and deep boreholes. The mean SHmax orientation varies between 133 ± 13∘ in the westernmost located Friedrich Heinrich coal mine and 168 ± 23∘ in the easternmost located Westphalia coal mine. The mean SHmax orientation, based on 87 data records from this and already published studies, of 161 ± 43∘ is in good agreement with the regional stress orientation observed in northwestern Europe. The presented public database provides in situ stress magnitude and stress orientation data records that are essential for the calibration of geomechanical numerical models on regional and/or reservoir scales for, among others, assessing stability issues of borehole trajectories, caverns, and georeservoirs in general. For an application example of this database, we estimate slip and dilation tendencies of major geological discontinuities, discovered during the 700-year-long coal mining activities in the region. The result, although burdened by high uncertainties, shows that the discontinuities striking in the N–S and NW–SE directions have a higher slip tendency compared to the ones striking ENE–WSW and NNW–SSE, whereas a high dilation tendency is observed for discontinuities striking NNW–SSE and a low dilation tendency for the ones striking ENE–WSW. The stress orientation database is available under https://doi.org/10.24406/fordatis/200 (Kruszewski et al., 2022a), the stress magnitude database is available under https://doi.org/10.24406/fordatis/201 (Kruszewski et al., 2022b), whereas the hydrofracturing test reports are available under https://doi.org/10.24406/fordatis/222 (Kruszewski et al., 2022c).

Read full abstract

Quality Ranking Research Articles

Related Topics

Articles published on Quality Ranking

Evaluating the Diagnostic Performance of Symptom Checkers: Clinical Vignette Study.

Vietnamese Higher Education Challenges: Quality Accreditation, Rankings and Ratings

Determinants of Operative Time in Arthroscopic Rotator Cuff Repair

A WEIGHTED NEURAL MATRIX FACTORIZATION HEALTH MANAGEMENT RECOMMENDATION ALGORITHM INTEGSCORING DEEP LEARNING TECHNOLOGY

HYDROXYAPATITE/Si-DOPED TiN INTERLAYER COATINGS AS POTENTIAL CANDIDATES FOR SURFACE MODIFICATION OF MEDICAL-GRADE 316 LVM STAINLESS STEEL FOR BIOIMPLANT APPLICATIONS

Developing Winning Tender Recommendation System: Fuzzy Moora Approach

Research on Information Quality Evaluation Method of Q&A Community from the Perspective of Fuzzy Evaluation

Performance feedback, blame avoidance, and data manipulation in the public sector: evidence from China’s official city air quality ranking

Noise quality estimation on portraits in realistic controlled scenarios

Quantitative evaluation of source parameters of historical earthquakes in southern Africa

Reconciling the Quality vs Popularity Dichotomy in Online Cultural Markets

The Effect of Family-Centered and Peer-Centered Education on the Sleep Quality of Hemodialysis Patients: A Randomized Clinical Trial.

Confidence-Aware Active Feedback for Interactive Instance Search

A Grey Simulation-Based Fuzzy Hierarchical Approach for Diagnosing Healthcare Service Quality

Unsupervised Palmprint Image Quality Assessment via Pseudo-Label Generation and Ranking Guidance

A Multisite Assessment of Inpatient Safety Event Rates During the Coronavirus Disease 2019 Pandemic

An examination of peer effects using high school competition realignments

Influence of air quality ranking on China's energy efficiency: spatial difference-in-differences model with multiple time periods.

In situ stress database of the greater Ruhr region (Germany) derived from hydrofracturing tests and borehole logs

Bayesian and MLE of R=P(Y>X) for Exponential Distribution Based on Varied L Ranked Set Sampling

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Quality Ranking Research Articles

Related Topics

Articles published on Quality Ranking

Evaluating the Diagnostic Performance of Symptom Checkers: Clinical Vignette Study.

Vietnamese Higher Education Challenges: Quality Accreditation, Rankings and Ratings

Determinants of Operative Time in Arthroscopic Rotator Cuff Repair

A WEIGHTED NEURAL MATRIX FACTORIZATION HEALTH MANAGEMENT RECOMMENDATION ALGORITHM INTEGSCORING DEEP LEARNING TECHNOLOGY

HYDROXYAPATITE/Si-DOPED TiN INTERLAYER COATINGS AS POTENTIAL CANDIDATES FOR SURFACE MODIFICATION OF MEDICAL-GRADE 316 LVM STAINLESS STEEL FOR BIOIMPLANT APPLICATIONS

Developing Winning Tender Recommendation System: Fuzzy Moora Approach

Research on Information Quality Evaluation Method of Q&amp;A Community from the Perspective of Fuzzy Evaluation

Performance feedback, blame avoidance, and data manipulation in the public sector: evidence from China’s official city air quality ranking

Noise quality estimation on portraits in realistic controlled scenarios

Quantitative evaluation of source parameters of historical earthquakes in southern Africa

Reconciling the Quality vs Popularity Dichotomy in Online Cultural Markets

The Effect of Family-Centered and Peer-Centered Education on the Sleep Quality of Hemodialysis Patients: A Randomized Clinical Trial.

Confidence-Aware Active Feedback for Interactive Instance Search

A Grey Simulation-Based Fuzzy Hierarchical Approach for Diagnosing Healthcare Service Quality

Unsupervised Palmprint Image Quality Assessment via Pseudo-Label Generation and Ranking Guidance

A Multisite Assessment of Inpatient Safety Event Rates During the Coronavirus Disease 2019 Pandemic

An examination of peer effects using high school competition realignments

Influence of air quality ranking on China's energy efficiency: spatial difference-in-differences model with multiple time periods.

In situ stress database of the greater Ruhr region (Germany) derived from hydrofracturing tests and borehole logs

Bayesian and MLE of R=P(Y&gt;X) for Exponential Distribution Based on Varied L Ranked Set Sampling

Research on Information Quality Evaluation Method of Q&A Community from the Perspective of Fuzzy Evaluation

Bayesian and MLE of R=P(Y>X) for Exponential Distribution Based on Varied L Ranked Set Sampling