Intraclass Consistency Research Articles

Background:Stroke is a prevalent disease with a significant global impact. Effective assessment of stroke severity is vital for an accurate diagnosis, appropriate treatment, and optimal clinical outcomes. The National Institutes of Health Stroke Scale (NIHSS) is a widely used scale for quantitatively assessing stroke severity. However, the current manual scoring of NIHSS is labor-intensive, time-consuming, and sometimes unreliable. Applying artificial intelligence (AI) techniques to automate the quantitative assessment of stroke on vast amounts of electronic health records (EHRs) has attracted much interest. Objective:This study aims to develop an automatic, quantitative stroke severity assessment framework through automating the entire NIHSS scoring process on Chinese clinical EHRs. Methods:Our approach consists of two major parts: Chinese clinical named entity recognition (CNER) with a domain-adaptive pre-trained large language model (LLM) and automated NIHSS scoring. To build a high-performing CNER model, we first construct a stroke-specific, densely annotated dataset “Chinese Stroke Clinical Records” (CSCR) from EHRs provided by our partner hospital, based on a stroke ontology that defines semantically related entities for stroke assessment. We then pre-train a Chinese clinical LLM coined “CliRoberta” through domain-adaptive transfer learning and construct a deep learning-based CNER model that can accurately extract entities directly from Chinese EHRs. Finally, an automated, end-to-end NIHSS scoring pipeline is proposed by mapping the extracted entities to relevant NIHSS items and values, to quantitatively assess the stroke severity. Results:Results obtained on a benchmark dataset CCKS2019 and our newly created CSCR dataset demonstrate the superior performance of our domain-adaptive pre-trained LLM and the CNER model, compared with the existing benchmark LLMs and CNER models. The high F1 score of 0.990 ensures the reliability of our model in accurately extracting the entities for the subsequent automatic NIHSS scoring. Subsequently, our automated, end-to-end NIHSS scoring approach achieved excellent inter-rater agreement (0.823) and intraclass consistency (0.986) with the ground truth and significantly reduced the processing time from minutes to a few seconds. Conclusion:Our proposed automatic and quantitative framework for assessing stroke severity demonstrates exceptional performance and reliability through directly scoring the NIHSS from diagnostic notes in Chinese clinical EHRs. Moreover, this study also contributes a new clinical dataset, a pre-trained clinical LLM, and an effective deep learning-based CNER model. The deployment of these advanced algorithms can improve the accuracy and efficiency of clinical assessment, and help improve the quality, affordability and productivity of healthcare services.

Background. Developed in 1994 by H. Kitaoka et al. the American Orthopaedic Foot and Ankle Society Ankle-Hindfoot scale (AOFAS-AHS) allows to assess pain, function, deformity and alignment of the foot and ankle. There is no Russian-language AOFAS-AHS questionnaire adapted according to current standards in the scientific literature. The aim of this paper is to perform the cross-cultural adaptation and to assess the validity of the Russian-language version of the AOFAS-AHS scale, including the evaluation of its psychometric properties. Methods. The original English version of the AOFAS-AHS scale was translated from English into Russian by a native Russian speaker. Then the questionnaire was back-translated into English by another translator whose native language is English. The next stage was the comparison of the original and back-translated versions, followed by the presentation of a pre-final cross-culturally adapted version, which was tested on 10 patients to ensure that the questions were comprehensible. The next step was the approval of the final version and its completion by patients to be operated on the hindfoot or ankle. The printed copy of the final version of the questionnaire was completed by the patients with an interval of 3 days. Total of 44 consecutive patients were enrolled, including 18 women (41%) and 26 men (59%), with a mean age of 61.7 (32-78) years. The psychometric properties of the Russian-language version of the AOFAS-AHS questionnaire (internal consistency, retest reliability, measurement error, responsiveness, and construct validity) were assessed based on the COSMIN (COnsensus-based Standards for the selection of health status Measurement INstruments) principles. Results. The mean score according to the AOFAS-AHS scale was 49.6 (min 2; max 82) out of a possible 100. The average time to complete the questionnaire was 4.2 minutes. All hypotheses formulated showed correlations of varying moderate to strong degrees. The Cronbach’s alpha coefficient was 0.76, which indicates a high level of internal consistency of the elements of the validated questionnaire. A good intra-class consistency of 0.83 was obtained, which shows a high degree of reliability of the questionnaire’s reproducibility. The ceiling and floor effects for the primary results of the questionnaires did not exceed 15%. The mean value of the Russian-language version of the AOFAS-AHS increased to 86.6 after surgical treatment. The values of standardized effect size (ES) and standardized response mean (SRM) were 5.56 and 4.83, respectively. Conclusion. The adapted Russian-language version of the AOFAS-AHS scale showed good psychometric properties and can be recommended for assessment of the physical activity in patients with ankle and hindfoot-related pathology and can also be used for monitoring the changes during the treatment.

Intraclass Consistency Research Articles

Related Topics

Articles published on Intraclass Consistency

Vehicle Classification Algorithm Based on Improved Vision Transformer

Identifying malicious traffic under concept drift based on intraclass consistency enhanced variational autoencoder

Deep conditional adversarial subdomain adaptation network for unsupervised mechanical fault diagnosis

Boundary-enhanced dual-stream network for semantic segmentation of high-resolution remote sensing images

Validity and Reliability of the Nurse Manager Performance Assessment Scale

DECIDE: A decoupled semantic and boundary learning network for precise osteosarcoma segmentation by integrating multi-modality MRI

Diabetes-related instrument to assess preventive behaviors among adolescents (DIAPBA): a tool development and psychometric research

Automatic quantitative stroke severity assessment based on Chinese clinical named entity recognition with domain-adaptive pre-trained large language model

RCPS: Rectified Contrastive Pseudo Supervision for Semi-Supervised Medical Image Segmentation.

Addressing Skewed Heterogeneity via Federated Prototype Rectification With Personalization.

Cross-Cultural Adaptation and Validation of the Russian-Language Version of the American Orthopaedic Foot and Ankle Society Ankle-Hindfoot Scale (AOFAS-AHS)

Multicenter Study of the Utility of Convolutional Neural Network and Transformer Models for the Detection and Segmentation of Meningiomas.

Agreement between measured energy expenditure and predictive energy equations in paediatric oncology

An AI-Based Image Quality Control Framework for Knee Radiographs

A Novel Remote Sensing Image Enhancement Method, the Pseudo-Tasseled Cap Transformation: Taking Buildings and Roads in GF-2 as an Example

Semantic-gap-oriented feature selection in hierarchical classification learning

Intra-class consistency and inter-class discrimination feature learning for automatic skin lesion classification.

Learning Features of Intra-Consistency and Inter-Diversity: Keys Toward Generalizable Deepfake Detection

SupCAM: Chromosome cluster types identification using supervised contrastive learning with category-variant augmentation and self-margin loss.

Novel Platform for Quantitative Assessment of Functional Object Interactions After Stroke.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Intraclass Consistency Research Articles

Related Topics

Articles published on Intraclass Consistency

Vehicle Classification Algorithm Based on Improved Vision Transformer

Identifying malicious traffic under concept drift based on intraclass consistency enhanced variational autoencoder

Deep conditional adversarial subdomain adaptation network for unsupervised mechanical fault diagnosis

Boundary-enhanced dual-stream network for semantic segmentation of high-resolution remote sensing images

Validity and Reliability of the Nurse Manager Performance Assessment Scale

DECIDE: A decoupled semantic and boundary learning network for precise osteosarcoma segmentation by integrating multi-modality MRI

Diabetes-related instrument to assess preventive behaviors among adolescents (DIAPBA): a tool development and psychometric research

Automatic quantitative stroke severity assessment based on Chinese clinical named entity recognition with domain-adaptive pre-trained large language model

RCPS: Rectified Contrastive Pseudo Supervision for Semi-Supervised Medical Image Segmentation.

Addressing Skewed Heterogeneity via Federated Prototype Rectification With Personalization.

Cross-Cultural Adaptation and Validation of the Russian-Language Version of the American Orthopaedic Foot and Ankle Society Ankle-Hindfoot Scale (AOFAS-AHS)

Multicenter Study of the Utility of Convolutional Neural Network and Transformer Models for the Detection and Segmentation of Meningiomas.

Agreement between measured energy expenditure and predictive energy equations in paediatric oncology

An AI-Based Image Quality Control Framework for Knee Radiographs

A Novel Remote Sensing Image Enhancement Method, the Pseudo-Tasseled Cap Transformation: Taking Buildings and Roads in GF-2 as an Example

Semantic-gap-oriented feature selection in hierarchical classification learning

Intra-class consistency and inter-class discrimination feature learning for automatic skin lesion classification.

Learning Features of Intra-Consistency and Inter-Diversity: Keys Toward Generalizable Deepfake Detection

SupCAM: Chromosome cluster types identification using supervised contrastive learning with category-variant augmentation and self-margin loss.

Novel Platform for Quantitative Assessment of Functional Object Interactions After Stroke.