Abstract

OBJECTIVES: Autism Spectrum Disorder (ASD) is a complex range of neurodegenerative conditions that impact individuals’ social behaviour and communication skills. However, ASD data often contains far more controls than cases. This poses a serious challenge when creating classification models due to deriving models that favour controls during the classification of individuals. This problem is known as class imbalance, and it may reduce the performance in classification models derived by machine learning (ML) techniques due to individuals may remain undetected. METHODS: ML appears to help in the distressing disorder by improving outcome quality besides speeding up the access to early diagnosis and consequential treatment. A screening dataset that consists of over 1100 instances was used to perform extensive quantitative analysis using different data resampling techniques and according to specific evaluation metrics. We measure the effect of class imbalance on autism screening performance using different data resampling techniques with a ML classifier and with respect to sensitivity, specificity, and F1-measure. We would like to know which resampling methods work well in balancing autism screening data. RESULTS: The results reveal that data resampling, and especially oversampling, improve results derived by the considered ML classifier. More importantly, there was superiority in terms of sensitivity and specificity for models derived by Naive Bayes classifier when oversampling methods have been used for data pre-processing on the autism data considered. CONCLUSION: The results reported encourages further improvement of the design and implementation of ASD screening systems using intelligent technology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.