Electromyography signals are electrical signals generated by muscle activity and are very useful for analyzing the health conditions of muscles and nerves. Data imbalance is a prevalent issue in EMG signal data, especially when addressing patients with varied health conditions and restricted data availability. A major difficulty for machine learning models is class imbalance in datasets, which frequently leads to biased predictions favoring the dominant class and neglecting the minority classes. The data augmentation method employs the Synthetic Minority Over Sampling Technique (SMOTE) and Random Over Sampling (ROS) to address data imbalances and enhance the performance of classification models for underrepresented classes. This study employs an oversampling technique to enhance the efficacy of the XG Boost model. SMOTE exhibits better efficacy relative to competing methods; the application of appropriate oversampling techniques allows models to integrate patterns from both majority and often neglected minority data.
Read full abstract