Abstract

Recently, audio data has increasingly becomes one of the prevalent source of information, especially after the exponential growth of using Internet, digital libraries systems and digital mobile devices. The currently massive amount of audio data stimulates working on developing custom audio retrieval tools to facilitate the audio retrieval tasks. The most familiar audio retrieval systems are based on searching using keyword, title or authors. This study presents the feasibility of using MEL Frequency Cepstral Coefficients (MFCCs) to extract features and Dynamic Time Warping (DTW) to compare the test patterns for Arabic audio news. The study proposes and implements architecture for content based audio retrieval system that is dedicated for the Arabic Audio News. The proposed architecture (ARANEWS) utilizes automatic speech recognition for isolated Arabic keyword speech mode; template based automatic speech recognition approach, MFCCs and DTW. ARANEWS presents a style of retrieval system that based on modeling signal waves and measuring the similarity between features that are extracted from spoken queries and spoken keywords. One of the major components that compose ARANEWS system is feature Database (ARANEWSDB). ARANEWSDB stores the extracted features (MFCCs) from the spoken keywords that are prepared to retrieve Arabic audio news. ARANEWS supports using Query by Humming (QBH) and Query by Example (QBE) instead of using query by text.

Highlights

  • There are many ideas may come to mind during designing of audio information retrieval system for audio news

  • The fields of study in content based audio retrieval: Many fields are classified under the umbrella of content based audio retrieval such as segmentation, Automatic Speech Recognition (ASR), Music Information Retrieval (MIR) and environmental sound retrieval (Mitrovic et al, 2010)

  • This study aims to investigate in viability of using MEL Frequency Cepstral Coefficients (MFCCs) as extracted feature to recognize Arabic speech

Read more

Summary

Introduction

There are many ideas may come to mind during designing of audio information retrieval system for audio news One of these ideas is using the lyrics of news. The keywords that reflect the nature of news are chosen from extracted lyrics and each audio news file is linked with its lyrics and with its extracted keywords. This kind of retrieval system supports the query by text; the type of data that are extracted and stored inside database is text data type. This method is called spoken document retrieval; in which written queries are used to search speech archives for relevant speech information (Fujii et al, 2002)

Objectives
Methods
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call