Dataset Bias Research Articles

Abstract Background The goal of awake craniotomy is to safely optimize the extent-of-resection in patients with glioma. Lower postoperative tumor volume is associated with better overall survival, emphasizing the importance of optimal surgical resection. Awake craniotomy causes less postoperative neurological deficits than surgery under general anesthesia, however, it is unclear whether awake craniotomy leads to lower postoperative tumor volume compared with general anesthesia in presumed lower grade glioma. Methods Retrospective, matched cohort study in patients with astrocytoma, IDH-mutant grade 2 and 3, oligodendroglioma, IDH-mutant, and 1p/19q-codeleted grade 2 and 3, and low-grade glioma, IDH-wildtype (now designated glioblastoma, IDH-wildtype), who underwent resection with awake craniotomy or under general anesthesia between 2003 and 2021 at Erasmus MC Brain Tumor Center. Pre- and postoperative tumor volumes were measured by semi-automatic 3D MRI-segmentation. First, we performed a multivariate logistic regression to assess which factors predicted selection for awake craniotomy. Thereafter, matching based on propensity score was attempted. Outcome variables were postoperative tumor volume, resection percentage and Karnofsky Performance Status (KPS) 3 months after surgery. Results We identified 181 awake craniotomy-patients and 135 general anesthesia-patients. Awake craniotomy-patients were younger, in better condition, more often male, with tumors more often in eloquent areas, in the left side of the brain and non-contrast enhancing. When performing matching without replacement, only 68 awake craniotomy-patients could be matched with 68 general anesthesia-patients, underscoring the imbalance in the dataset. Matching with replacement yielded a matched dataset of 181 awake craniotomy-patients with 60 general anesthesia-patients with adequate matching on most baseline variables, except for eloquent area (47% for awake craniotomy and 21.7% for general anesthesia, p &lt; 0.001). In this matched dataset, median postoperative volume in awake craniotomy was 5.8 mL (IQR 0 - 92.8) vs 12.2 mL (IQR 0 - 90.1) in general anesthesia (p-value = 0.114). Resection percentages did not differ between the awake craniotomy- and general anesthesia-groups. KPS scores at 3 months after surgery did not differ between awake craniotomy and general anesthesia (p = 0.15). Conclusion Postoperative tumor volume was not lower in awake craniotomy-patients than in general anesthesia-patients. Neither were resection percentage and KPS scores at 3 months after surgery significantly different between the groups. Adequate matching was only obtainable using replacement, underscoring the risk of bias in unmatched datasets. These data should be interpreted with care, given the retrospective nature, potential residual confounding and potential lack of generalizability due to unmatched normal resection patients.

Coronavirus disease 2019 (COVID-19) started in Wuhan, China, in late 2019, and after being utterly contagious in Asian countries, it rapidly spread to other countries. This disease caused governments worldwide to declare a public health crisis with severe measures taken to reduce the speed of the spread of the disease. This pandemic affected the lives of millions of people. Many citizens that lost their loved ones and jobs experienced a wide range of emotions, such as disbelief, shock, concerns about health, fear about food supplies, anxiety, and panic. All of the aforementioned phenomena led to the spread of racism and hate against Asians in western countries, especially in the United States. An analysis of official preliminary police data by the Center for the Study of Hate & Extremism at California State University shows that Anti-Asian hate crime in 16 of America's largest cities increased by 149% in 2020. In this study, we first chose a baseline of Americans' hate crimes against Asians on Twitter. Then we present an approach to balance the biased dataset and consequently improve the performance of tweet classification. We also have downloaded 10 million tweets through the Twitter API V-2. In this study, we have used a small portion of that, and we will use the entire dataset in the future study. In this article, three thousand tweets from our collected corpus are annotated by four annotators, including three Asian and one Asian-American. Using this data, we built predictive models of hate speech using various machine learning and deep learning methods. Our machine learning methods include Random Forest, K-nearest neighbors (KNN), Support Vector Machine (SVM), Extreme Gradient Boosting (XGBoost), Logistic Regression, Decision Tree, and Naive Bayes. Our Deep Learning models include Basic Long-Term Short-Term Memory (LSTM), Bidirectional LSTM, Bidirectional LSTM with Drop out, Convolution, and Bidirectional Encoder Representations from Transformers (BERT). We also adjusted our dataset by filtering tweets that were ambiguous to the annotators based on low Fleiss Kappa agreement between annotators. Our final result showed that Logistic Regression achieved the best statistical machine learning performance with an F1 score of 0.72, while BERT achieved the best performance of the deep learning models, with an F1-Score of 0.85.

Dataset Bias Research Articles

Related Topics

Articles published on Dataset Bias

Evaluation methodology for deep learning imputation models.

A survey on bias in visual datasets

P07.11.A Impact of awake craniotomy on extent-of-resection and performance in glioma: a retrospective, propensity-score matched cohort study

Product- and Hydro-Validation of Satellite-Based Precipitation Data Sets for a Poorly Gauged Snow-Fed Basin in Turkey

Analysis of biases in automatic white balance datasets and methods

Predictive modeling of microbiological seawater quality in karst region using cascade model

Ethical, Legal, and Financial Considerations of Artificial Intelligence in Surgery.

The silent trial - the bridge between bench-to-bedside clinical AI applications.

Asian hate speech detection on Twitter during COVID-19.

Covert Network Construction, Disruption, and Resilience: A Survey

Detection of Fake News Based on Typical Machine Learning Models

Kernel dependence regularizers and Gaussian processes with applications to algorithmic fairness

Toward a perceptive pretraining framework for Audio-Visual Video Parsing

SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation

Limited generalizability of single deep neural network for surgical instrument segmentation in different surgical environments

Cost-Sensitive Metaheuristic Optimization-Based Neural Network with Ensemble Learning for Financial Distress Prediction

Robust Detection of Fake News Using LSTM and GloVe Embeddings

Obtaining Calibrated Probabilities with Personalized Ranking Models

Intelligent robust malware detection by implementing deep learning

Scene graph generation with award-punishment strategy

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dataset Bias Research Articles

Related Topics

Articles published on Dataset Bias

Evaluation methodology for deep learning imputation models.

A survey on bias in visual datasets

P07.11.A Impact of awake craniotomy on extent-of-resection and performance in glioma: a retrospective, propensity-score matched cohort study

Product- and Hydro-Validation of Satellite-Based Precipitation Data Sets for a Poorly Gauged Snow-Fed Basin in Turkey

Analysis of biases in automatic white balance datasets and methods

Predictive modeling of microbiological seawater quality in karst region using cascade model

Ethical, Legal, and Financial Considerations of Artificial Intelligence in Surgery.

The silent trial - the bridge between bench-to-bedside clinical AI applications.

Asian hate speech detection on Twitter during COVID-19.

Covert Network Construction, Disruption, and Resilience: A Survey

Detection of Fake News Based on Typical Machine Learning Models

Kernel dependence regularizers and Gaussian processes with applications to algorithmic fairness

Toward a perceptive pretraining framework for Audio-Visual Video Parsing

SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation

Limited generalizability of single deep neural network for surgical instrument segmentation in different surgical environments

Cost-Sensitive Metaheuristic Optimization-Based Neural Network with Ensemble Learning for Financial Distress Prediction

Robust Detection of Fake News Using LSTM and GloVe Embeddings

Obtaining Calibrated Probabilities with Personalized Ranking Models

Intelligent robust malware detection by implementing deep learning

Scene graph generation with award-punishment strategy