Wrong Classification Research Articles

AbstractAssessing human stress in agriculture proves to be a complex and time‐intensive endeavor within the field of ergonomics, particularly for the development of agricultural systems. This methodology involves the utilization of instrumentation and the establishment of a dedicated laboratory setup. The complexity arises from the need to capture and analyze various physiological and psychological indicators, such as heart rate (HR), muscle activity, and subjective feedback to comprehensively assess the impact of farm operations on subjects. The instrumentation typically includes wearable devices, sensors, and monitoring equipment to gather real‐time data of subject during the performance of farm operations. Deep learning (DL) models currently achieve human performance levels on real‐world face recognition tasks. In this study, we went beyond face recognition and experimented with the recognition of human stress based on facial features during the drudgery‐prone agricultural operation of sugarcane harvesting. This is the first research study for deploying artificial intelligence‐driven DL techniques to identify human stress in agriculture instead of monitoring several ergonomic characteristics. A total of 20 (10 each for male and female) subjects comprising 4300 augmented RGB images (215 per subject) were acquired during sugarcane harvesting seasons and then these images were deployed for training (80%) and validation (20%). Human stress and nonstress states were determined based on four ergonomic physiological parameters: heart rate (ΔHR), oxygen consumption rate (OCR), energy expenditure rate (EER), and acceptable workload (AWL). Stress was defined when ΔHR, OCR, EER, and AWL reached or exceeded certain standard threshold values. Four convolutional neural network‐based DL models (1) DarkNet53, (2) InceptionV3, (3) MobileNetV2 and (4) ResNet50 were selected due to their remarkable feature extraction abilities, simple and effective implementation to edge computation devices. In all four DL models, training performance results delivered training accuracy ranging from 73.8% to 99.1% at combinations of two mini‐batch sizes and four levels of epochs. The maximum training accuracies were 99.1%, 99.0%, 97.7%, and 95.4% at the combination of mini‐batch size 16 and 25 epochs for DarkNet53, InceptionV3, ResNet50, and MobileNetV2, respectively. Due to the best performance, DarkNet53 was tested further on an independent data set of 100 images and found 89.8%–93.3% confident to classify stressed images for female subjects while 92.2%–94.5% for male subjects, though it was trained on the integrated data set. The comparative classification of the developed model and ergonomic measurements for stress classification was carried out with a net accuracy of 88% where there were few instances of wrong classifications.

Read full abstract

The Stopping Opioids After Surgery (SOS) score is a validated tool that was developed to determine the risk of sustained opioid use after surgical interventions, including orthopaedic procedures. Despite prior investigations validating the SOS score in diverse contexts, its performance across racial, ethnic, and socioeconomic subgroups has not been assessed. In a large, urban, academic health network, did the performance of the SOS score differ depending on (1) race and ethnicity or (2) socioeconomic status? This retrospective investigation was conducted using data from an internal, longitudinally maintained registry of a large, urban, academic health system in the Northeastern United States. Between January 1, 2018, and March 31, 2022, we treated 26,732 adult patients via rotator cuff repair, lumbar discectomy, lumbar fusion, TKA, THA, ankle or distal radius open reduction and internal fixation, or ACL reconstruction. We excluded 1% of patients (274 of 26,732) because of missing length of stay information, 0.06% (15) for missing discharge information, 1% (310) for missing medication information related to loss to follow-up, and 0.07% (19) who died during their hospital stay. Based on these inclusion and exclusion criteria, 26,114 adult patients were left for analysis. The median age in our cohort was 63 years (IQR 52 to 71), and most patients were women (52% [13,462 of 26,114]). Most patients self-reported their race and ethnicity as non-Hispanic White (78% [20,408 of 26,114]), but the cohort also included non-Hispanic Black (4% [939]), non-Hispanic Asian (2% [638]), and Hispanic (1% [365]) patients. Five percent (1295) of patients were of low socioeconomic status, defined by prior SOS score investigations as patients with Medicaid insurance. Components of the SOS score and the observed frequency of sustained postoperative opioid prescriptions were abstracted. The performance of the SOS score was compared across racial, ethnic, and socioeconomic subgroups using the c-statistic, which measures the capacity of the model to differentiate between patients with and without sustained opioid use. This measure should be interpreted on a scale between 0 and 1, where 0 represents a model that perfectly predicts the wrong classification, 0.5 represents performance no better than chance, and 1.0 represents perfect discrimination. Scores less than 0.7 are generally considered poor. The baseline performance of the SOS score in past investigations has ranged from 0.76 to 0.80. The c-statistic for non-Hispanic White patients was 0.79 (95% CI 0.78 to 0.81), which fell within the range of past investigations. The SOS score performed worse for Hispanic patients (c-statistic 0.66 [95% CI 0.52 to 0.79]; p < 0.001), where it tended to overestimate patients' risks of sustained opioid use. The SOS score for non-Hispanic Asian patients did not perform worse than in the White patient population (c-statistic 0.79 [95% CI 0.67 to 0.90]; p = 0.65). Similarly, the degree of overlapping CIs suggests that the SOS score did not perform worse in the non-Hispanic Black population (c-statistic 0.75 [95% CI 0.69 to 0.81]; p = 0.003). There was no difference in score performance among socioeconomic groups (c-statistic 0.79 [95% CI 0.74 to 0.83] for socioeconomically disadvantaged patients; 0.78 [95% CI 0.77 to 0.80] for patients who were not socioeconomically disadvantaged; p = 0.92). The SOS score performed adequately for non-Hispanic White patients but performed worse for Hispanic patients, where the 95% CI nearly included an area under the curve value of 0.5, suggesting that the tool is no better than chance at predicting sustained opioid use for Hispanic patients. In the Hispanic population, it commonly overestimated the risk of opioid dependence. Its performance did not differ among patients of different sociodemographic backgrounds. Future studies might seek to contextualize why the SOS score overestimates expected opioid prescriptions for Hispanic patients and how the utility performs among more specific Hispanic subgroups. The SOS score is a valuable tool in ongoing efforts to combat the opioid epidemic; however, disparities exist in terms of its clinical applicability. Based on this analysis, the SOS score should not be used for Hispanic patients. Additionally, we provide a framework for how other predictive models should be tested in various lesser-represented populations before implementation.

Read full abstract

Wrong Classification Research Articles

Articles published on Wrong Classification

SSAT: Active Authorization Control and User’s Fingerprint Tracking Framework for DNN IP Protection

Intensive treatment course to identify pseudoresistant epilepsy and expedite surgery referrals - A prospective intervention study

Moral reasoning in a digital age: blaming artificial intelligence for incorrect high-risk decisions

Prediction of Heart Disease Risk among Patients in Federal Medical Centre, Abeokuta Using Naïve Bayes

Classification and detection of noise in IoT based MQ gas sensors using artificial neural network

Quantitative evaluation of internal cluster validation indices using binary data sets

Improved medical image inpainting using automatic multi-task learning driven deep learning approach

Convolutional neural networks to classify human stress that occurs during in‐field sugarcane harvesting: A case study

Simplified Convolutional Neural Network Model for Automatic Classification of Retinal Diseases from Optical Coherence Tomography Images

The consequences of the new European reclassification of non-invasive brain stimulation devices and the medical device regulations pose an existential threat to research and treatment: An invited opinion paper

A Bayesian deep learning framework for reliable fault diagnosis in wind turbine gearboxes under various operating conditions

Determination of the Diagnostic Performance of Laboratory Tests in the Absence of a Perfect Reference Standard: The Case of SARS-CoV-2 Tests.

Using adaptive learning rate to generate adversarial images

Improved Two Stage Generative Adversarial Networks for Adversarial Example Generation with Real Exposure

A two-stage Kriging estimation variance reduction method for efficient time-variant reliability-based design optimization

Criminalistic Aspects of Classifying Investigatory Actions as One of the Key Means of Proof

Impact of synthetic noise signature and physiologic ECG signal on designing ML-based ECG noise detection framework.

우리나라 교육과정 통해 살펴본 무용 교육

Boosting Semi-Supervised Semantic Segmentation with Probabilistic Representations

Does the Stopping Opioids After Surgery Score Perform Well Among Racial and Socioeconomic Subgroups?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Wrong Classification Research Articles

Articles published on Wrong Classification

SSAT: Active Authorization Control and User’s Fingerprint Tracking Framework for DNN IP Protection

Intensive treatment course to identify pseudoresistant epilepsy and expedite surgery referrals - A prospective intervention study

Moral reasoning in a digital age: blaming artificial intelligence for incorrect high-risk decisions

Prediction of Heart Disease Risk among Patients in Federal Medical Centre, Abeokuta Using Naïve Bayes

Classification and detection of noise in IoT based MQ gas sensors using artificial neural network

Quantitative evaluation of internal cluster validation indices using binary data sets

Improved medical image inpainting using automatic multi-task learning driven deep learning approach

Convolutional neural networks to classify human stress that occurs during in‐field sugarcane harvesting: A case study

Simplified Convolutional Neural Network Model for Automatic Classification of Retinal Diseases from Optical Coherence Tomography Images

The consequences of the new European reclassification of non-invasive brain stimulation devices and the medical device regulations pose an existential threat to research and treatment: An invited opinion paper

A Bayesian deep learning framework for reliable fault diagnosis in wind turbine gearboxes under various operating conditions

Determination of the Diagnostic Performance of Laboratory Tests in the Absence of a Perfect Reference Standard: The Case of SARS-CoV-2 Tests.

Using adaptive learning rate to generate adversarial images

Improved Two Stage Generative Adversarial Networks for Adversarial Example Generation with Real Exposure

A two-stage Kriging estimation variance reduction method for efficient time-variant reliability-based design optimization

Criminalistic Aspects of Classifying Investigatory Actions as One of the Key Means of Proof

Impact of synthetic noise signature and physiologic ECG signal on designing ML-based ECG noise detection framework.

우리나라 교육과정 통해 살펴본 무용 교육

Boosting Semi-Supervised Semantic Segmentation with Probabilistic Representations

Does the Stopping Opioids After Surgery Score Perform Well Among Racial and Socioeconomic Subgroups?