Inference Tasks Research Articles

The bidirectional encoder representations from transformers (BERT) model has attracted considerable attention in clinical applications, such as patient classification and disease prediction. However, current studies have typically progressed to application development without a thorough assessment of the model's comprehension of clinical context. Furthermore, limited comparative studies have been conducted on BERT models using medical documents from non-English-speaking countries. Therefore, the applicability of BERT models trained on English clinical notes to non-English contexts is yet to be confirmed. To address these gaps in literature, this study focused on identifying the most effective BERT model for non-English clinical notes. In this study, we evaluated the contextual understanding abilities of various BERT models applied to mixed Korean and English clinical notes. The objective of this study was to identify the BERT model that excels in understanding the context of such documents. Using data from 164,460 patients in a South Korean tertiary hospital, we pretrained BERT-base, BERT for Biomedical Text Mining (BioBERT), Korean BERT (KoBERT), and Multilingual BERT (M-BERT) to improve their contextual comprehension capabilities and subsequently compared their performances in 7 fine-tuning tasks. The model performance varied based on the task and token usage. First, BERT-base and BioBERT excelled in tasks using classification ([CLS]) token embeddings, such as document classification. BioBERT achieved the highest F1-score of 89.32. Both BERT-base and BioBERT demonstrated their effectiveness in document pattern recognition, even with limited Korean tokens in the dictionary. Second, M-BERT exhibited a superior performance in reading comprehension tasks, achieving an F1-score of 93.77. Better results were obtained when fewer words were replaced with unknown ([UNK]) tokens. Third, M-BERT excelled in the knowledge inference task in which correct disease names were inferred from 63 candidate disease names in a document with disease names replaced with [MASK] tokens. M-BERT achieved the highest hit@10 score of 95.41. This study highlighted the effectiveness of various BERT models in a multilingual clinical domain. The findings can be used as a reference in clinical and language-based applications.

Read full abstract

It is well established that the natural frequencies (NF) format is cognitively more beneficial for Bayesian inference than the conditional probabilities (CP) format. However, empirical studies have suggested that the NF facilitation effect might be limited to specific groups of individuals. Unlike previous studies that focused on a limited number of Bayesian inference problems evaluated by a single scoring method, it was essential to examine multiple Bayesian problems using various scoring metrics. This study also explored the impact of numeracy on Bayesian inference and assessed non-Bayesian cognitive strategies using the numerical information in problem solving. In a Web-based experimental survey, 175 South Korean adults were randomly assigned to 1 of 2 format groups (NF v. CP). After completing numeracy scales, participants were asked to estimate 4 Bayesian inference problems and document the numerical information used in their problem-solving process. Four scoring methods-strict rounding, loose rounding, absolute deviation, and 50-Split-were used to evaluate participants' estimations. The NF format generally outperformed the CP format across all problems, except in a chorionic villus sampling test problem when evaluated using the 50-Split method. In addition, numeracy levels significantly influenced Bayesian inference; participants with higher numeracy demonstrated better performance. In addition, participants used various non-Bayesian strategies influenced by the format and the nature of the problems. The NF facilitation effect was consistently observed across multiple Bayesian problems and scoring methods. Individuals with higher numeracy levels benefited more from the NF format. The use of various non-Bayesian strategies varied with the formats and nature of specific tasks. The natural frequencies (NF) format is known to foster understanding of medical test results compared with the conditional probabilities (CP) format, but some studies have reported that this benefit is either nonexistent or limited to specific groups.This study aims to replicate previous empirical studies using various Bayesian problems using multiple scoring methods.The NF format fosters understanding of medical test results across all Bayesian problems by all scoring methods, except in the CVS problem when using a 50-Split scoring method.Participants with high numeracy perform better Bayesian inference than those with lower numeracy. Particularly, higher numerates benefit more in the NF format than lower numerates do. In addition, the public tend to use various non-Bayesian reasoning strategies depending on the format and the nature of the tasks.

Read full abstract

Inference Tasks Research Articles

Related Topics

Articles published on Inference Tasks

Dynamic inferences of crash risks in freeway merging zones: a spatio-temporal deep learning model

Calibrating Bayesian Generative Machine Learning for Bayesiamplification

A Joint Survey in Decentralized Federated Learning and TinyML: A Brief Introduction to Swarm Learning

Robust and energy-efficient RPL optimization algorithm with scalable deep reinforcement learning for IIoT

Lost in the Shuffle: Testing Power in the Presence of Errorful Network Vertex Labels

Multifaceted Natural Language Processing Task-Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation.

Tree sequences as a general-purpose tool for population genetic inference.

Do young American children essentialize ethnicity? Examining inductive inferences about Hispanic/Latinx individuals in an ethnically diverse sample

Watch your tune! On the role of intonation for scalar diversity

Resource‐adaptive and OOD‐robust inference of deep neural networks on IoT devices

Multiple network embedding for anomaly detection in time series of graphs

Tree sequences as a general-purpose tool for population genetic inference.

Demonstration of 4-quadrant analog in-memory matrix multiplication in a single modulation

SCL: A sustainable deep learning solution for edge computing ecosystem in smart manufacturing

Natural Language Inference with Transformer Ensembles and Explainability Techniques

Energy-Efficient Industrial Internet of Things in Green 6G Networks

A geometrical solution underlies general neural principle for serial ordering

Natural Frequencies Improve Public Understanding of Medical Test Results: An Experimental Study on Various Bayesian Inference Tasks with Multiple Scoring Methods and Non-Bayesian Reasoning Strategies.

Variability of theory of mind versus pragmatic ability in typical and atypical development

Lower confidence and increased error sensitivity in OCD patients while learning under volatility

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Inference Tasks Research Articles

Related Topics

Articles published on Inference Tasks

Dynamic inferences of crash risks in freeway merging zones: a spatio-temporal deep learning model

Calibrating Bayesian Generative Machine Learning for Bayesiamplification

A Joint Survey in Decentralized Federated Learning and TinyML: A Brief Introduction to Swarm Learning

Robust and energy-efficient RPL optimization algorithm with scalable deep reinforcement learning for IIoT

Lost in the Shuffle: Testing Power in the Presence of Errorful Network Vertex Labels

Multifaceted Natural Language Processing Task-Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation.

Tree sequences as a general-purpose tool for population genetic inference.

Do young American children essentialize ethnicity? Examining inductive inferences about Hispanic/Latinx individuals in an ethnically diverse sample

Watch your tune! On the role of intonation for scalar diversity

Resource‐adaptive and OOD‐robust inference of deep neural networks on IoT devices

Multiple network embedding for anomaly detection in time series of graphs

Tree sequences as a general-purpose tool for population genetic inference.

Demonstration of 4-quadrant analog in-memory matrix multiplication in a single modulation

SCL: A sustainable deep learning solution for edge computing ecosystem in smart manufacturing

Natural Language Inference with Transformer Ensembles and Explainability Techniques

Energy-Efficient Industrial Internet of Things in Green 6G Networks

A geometrical solution underlies general neural principle for serial ordering

Natural Frequencies Improve Public Understanding of Medical Test Results: An Experimental Study on Various Bayesian Inference Tasks with Multiple Scoring Methods and Non-Bayesian Reasoning Strategies.

Variability of theory of mind versus pragmatic ability in typical and atypical development

Lower confidence and increased error sensitivity in OCD patients while learning under volatility