Abstract
Spam communications are organized attempts of falsified claims with the purpose of marketing, spreading false information and deceiving the end recipient. Phone spam is an international nuisance, with the U. S. among the most spammed countries in the world in 2020. In addition to the agitating nature of these calls, criminal scams are defrauding subscribers of billions of dollars every year. Therefore, it is necessary to develop automated systems for the identification of spam calls to minimize fraud and reduce the displeasure of receiving them. The call origin, call duration and other Call Detail Records can be used to assess whether a call is fraudulent or not, but the actual audio content is overlooked. This work focuses on extracting acoustic features from voicemail recordings containing speech, which are used to train Machine Learning models that identify spam calls. Both local and global feature descriptors are used, including Mel-Frequency Cepstral Coefficients and Log-Mel Spectrum, and their efficacy for distinguishing spam from non-spam calls is explored. We demonstrate that a spam voice call can be detected while relying only on the acoustic information of the call. A further analysis of the temporal and spectral features that are most informative for the task is also presented.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.