High-quality Datasets Research Articles

Abstract Background The European healthcare system is reliant on its digital transformation to deal with challenges like rising expenditures or workforce shortage. The digital transformation is inevitably accompanied by the implementation of electronic medical records (EMR) and the ongoing adaptation of existing ones. Systematic reviews indicate that this process can have an impact on medical records data quality (DQ) [1]. At micro level, DQ is essential to ensure high quality of care. At macro level, sufficient DQ is a prerequisite for big data analyzability. In this field, completeness is a commonly analyzed dimension of DQ and empirical results indicate that completeness can improve but also deteriorate as a result of the described implementation or adoption of EMRs [2]. The aim of this work was to investigate the implementation of EMRs in comparable settings and to observe and discuss possible differences in the change in DQ. Methods Data was collected on three surgical clinics of a German academic teaching hospital before and after the implementation of an EMR. Paper-based and electronic medical records were compared. Analysis focused on ten items that were commonly documented in both record types (e.g. pain). T-tests and χ²-tests were used to compare average completeness per record type and percentage of completeness per item. Results A total of N = 659 records was analyzed. Overall, results show a significant improvement in completeness from an average of 6.0/10 items in the paper-based record type to 7.2/10 in the EMR (p&lt;.05). At clinic level, improvement rates vary from 0.9 to 1.4. At the level of the specific items, significant deteriorations are visible in certain clinics. Conclusions Results suggest that DQs variability is context-dependent (e.g. on the clinic’s turnover rate or its patient’s length of stay). Due to the unavoidable digital transformation, a detailed context and needs analysis involving all stakeholders should be carried out before any changes are made. Key messages • The application of advanced analytics such as big data or AI training is reliant on the availability of high-quality datasets. • Electronic medical records have been demonstrated to enhance data quality, but it remains uncertain how and why improvements appear to be context-dependent.

Read full abstract

The application of remote sensing technology in water body detection has become increasingly widespread, offering significant value for environmental monitoring, hydrological research, and disaster early warning. However, the existing methods face challenges in multi-scene and multi-temporal water body detection, including the diverse variations in water body shapes and sizes that complicate detection; the complexity of land cover types, which easily leads to false positives and missed detections; the high cost of acquiring high-resolution images, limiting long-term applications; and the lack of effective handling of multi-temporal data, making it difficult to capture the dynamic changes in water bodies. To address these challenges, this study proposes a novel network for multi-scene and multi-temporal water body detection based on spatiotemporal feature extraction, named TSAE-UNet. TSAE-UNet integrates convolutional neural networks (CNN), depthwise separable convolutions, ConvLSTM, and attention mechanisms, significantly improving the accuracy and robustness of water body detection by capturing multi-scale features and establishing long-term dependencies. The Otsu method was employed to quickly process Sentinel-1A and Sentinel-2 images, generating a high-quality training dataset. In the first experiment, five rectangular areas of approximately 37.5 km2 each were selected to validate the water body detection performance of the TSAE-UNet model across different scenes. The second experiment focused on Jining City, Shandong Province, China, analyzing the monthly water body changes from 2020 to 2022 and the quarterly changes in 2022. The experimental results demonstrate that TSAE-UNet excels in multi-scene and long-term water body detection, achieving a precision of 0.989, a recall of 0.983, an F1 score of 0.986, and an IoU of 0.974, significantly outperforming FCN, PSPNet, DeepLabV3+, ADCNN, and MECNet.

Read full abstract

High-quality Datasets Research Articles

Articles published on High-quality Datasets

Blending is all you need: Data-centric ensemble synthetic data

Bi-directional information interaction for multi-modal 3D object detection in real-world traffic scenes

Mapping human footprint changes over Qingzang Plateau

A Multi-Scale Feature Fusion Deep Learning Network for the Extraction of Cropland Based on Landsat Data

Designing a Compact Portable Electro-Cardio Device Enhanced by Machine Learning for Early Detection of Myocardial Infraction

The Investigation of Hyperparameters Influence on Fruit and Vegetable Image Recognition Performance Using SVM

Improving data quality by implementing an electronic medical record seems to depend on the context

WaterGPT: Training a Large Language Model to Become a Hydrology Expert

DMFGAN: a multifeature data augmentation method for grape leaf disease identification.

Multi-Step Temperature Prognosis of Lithium-Ion Batteries for Real Electric Vehicles Based on a Novel Bidirectional Mamba Network and Sequence Adaptive Correlation

A multispectral camera in the VIS–NIR equipped with thermal imaging and environmental sensors for non invasive analysis in precision agriculture

A compound-target pairs dataset: differences between drugs, clinical candidates and other bioactive compounds

Enhanced stereodivergent evolution of carboxylesterase for efficient kinetic resolution of near-symmetric esters through machine learning

A Scoping Review on the Application of Artificial Intelligence in Treating Rare Diseases

The Zenith Total Delay Combination of International GNSS Service Repro3 and the Analysis of Its Precision

PrescDRL: deep reinforcement learning for herbal prescription planning in treatment of chronic diseases

TSAE-UNet: A Novel Network for Multi-Scene and Multi-Temporal Water Body Detection Based on Spatiotemporal Feature Extraction

Implementation of an Automated System Using Machine Learning Models to Accelerate the Process of In Silico Identification of Small Molecules As Drug Candidates.

A LSTM algorithm-driven deep learning approach to estimating repair and maintenance costs of apartment buildings

Accelerated Data Engine: A faster dataset construction workflow for computer vision applications in commercial livestock farms

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

High-quality Datasets Research Articles

Articles published on High-quality Datasets

Blending is all you need: Data-centric ensemble synthetic data

Bi-directional information interaction for multi-modal 3D object detection in real-world traffic scenes

Mapping human footprint changes over Qingzang Plateau

A Multi-Scale Feature Fusion Deep Learning Network for the Extraction of Cropland Based on Landsat Data

Designing a Compact Portable Electro-Cardio Device Enhanced by Machine Learning for Early Detection of Myocardial Infraction

The Investigation of Hyperparameters Influence on Fruit and Vegetable Image Recognition Performance Using SVM

Improving data quality by implementing an electronic medical record seems to depend on the context

WaterGPT: Training a Large Language Model to Become a Hydrology Expert

DMFGAN: a multifeature data augmentation method for grape leaf disease identification.

Multi-Step Temperature Prognosis of Lithium-Ion Batteries for Real Electric Vehicles Based on a Novel Bidirectional Mamba Network and Sequence Adaptive Correlation

A multispectral camera in the VIS–NIR equipped with thermal imaging and environmental sensors for non invasive analysis in precision agriculture

A compound-target pairs dataset: differences between drugs, clinical candidates and other bioactive compounds

Enhanced stereodivergent evolution of carboxylesterase for efficient kinetic resolution of near-symmetric esters through machine learning

A Scoping Review on the Application of Artificial Intelligence in Treating Rare Diseases

The Zenith Total Delay Combination of International GNSS Service Repro3 and the Analysis of Its Precision

PrescDRL: deep reinforcement learning for herbal prescription planning in treatment of chronic diseases

TSAE-UNet: A Novel Network for Multi-Scene and Multi-Temporal Water Body Detection Based on Spatiotemporal Feature Extraction

Implementation of an Automated System Using Machine Learning Models to Accelerate the Process of In Silico Identification of Small Molecules As Drug Candidates.

A LSTM algorithm-driven deep learning approach to estimating repair and maintenance costs of apartment buildings

Accelerated Data Engine: A faster dataset construction workflow for computer vision applications in commercial livestock farms