Since there have been more and more incidents of women being harassed in the recent past, girls need to think twice before going out of their houses. Sometimes, they are not even safe in their house or workplace. These circumstance does not change for children of all genders who stay alone because of their working parents or for other reasons. Also, there is no such organized procedure to ensure safety and take women and children out of such violence and harassment. To address this problem, the authors of this paper developed an Android-based automated system to detect danger for women and children using audio from the surroundings. As the Android phone is available to everyone nowadays, they focused on using this device rather than developing a system on some external hardware. Different signal processing methods with deep learning techniques are used for this work. This work also addresses noise from the environment for any chaos and nullifies them using different noise reduction techniques such as Reduce Energy Noise, Reduce Mel Frequency Cepstrum Coefficient (MFCC) up Noise, Reduce Median Noise, Reduce Centroid Noise, Audio DeNoise, Noisereduce by Sainburf et al. & Butterworth high pass filter. The Noisereduce by Sainburg et al. along with the InceptionV3 model architecture turns out to be the best to classify audio with 95.51% accuracy. A new model called AudioViT is introduced. It uses a Visual Transformer and Residual Network to identify the audio signal. The Android application also takes necessary action when any unfavorable situation is detected. Android device users can use this application without any cost, which will pave the way to ensure the safety of women and children.
Read full abstract