Abstract
Over the last few decades, signal feature analysis has been significantly used in a wide variety of fields. While several techniques have been proposed in the area of signal feature extraction and classification, all of these techniques are achieved by using modern computers, which rely on softwares, such as MATLAB. However, in real-time applications or portable devices, software implementation is not enough by itself, and a hardware-software co-design or fully hardware implementation needs to be considered. The selection of the right signal feature analysis tool for an application depends not only on the software performance, but also on the hardware efficiency of a method. However, there is not enough studies in existence to provide comparison of these signal feature extraction methods from the hardware implentation aspect. Therefore, the objective of this thesis is to investigate both the hardware and algorithmic perspectives of three commonly used signal feature extraction techniques: Autoregressive (AR), pole modeling, and Mel-frequency Cepstral coefficients (MFCCs). To fulfill this objective, first, the hardware analysis of AR, pole modeling, and MFCC feature extraction methods is performed by calculating the computational complexity of the mathematical equations of each method. Second the FPGA area usage of each feature extraction methods is estimated. Third, algorithmic evaluation of these three methods is performed for audio scene analysis. Once the results are obtained from the above stages, the overall performance of each feature extraction method is compared in terms of both the hardware analysis and algorithmic performances. Finally, based on the performed comparison, pole modeling feature extraction approach is proposed as the suitable method for the audio scene analysis application. The suitable method (pole modeling feature extraction) + linear discriminant analysis (LDA) classifier are implemented in Altera DE2 Board using Altera Nios II soft-core processor. The audio classification accuracy obtained using this implementation is achieved to be equal to the MATLAB implementation. The classification time for one audio sample is determined to be 0.1s, which is fast enough to be considered as a real-time system for audio scene analysis application.
Highlights
1.1 MotivationSIGNAL analysis has been a field of considerable interest and significant growth over the last century
AR modeling and Mel-frequency Cepstral coefficients (MFCCs) features have been previously proposed for audio signal classification; as highlighted in the diagram of Fig. 2.1, the pole features are used in audio signal classification for the first time
An audio classification is performed in MATLAB as follows: 1) First, all the 80 audio signals are transformed into AR, pole, and MFCC features
Summary
1.1 MotivationSIGNAL analysis has been a field of considerable interest and significant growth over the last century. The present Chapter evaluates and compares the algorithmic and hardware performances of AR, Pole, and MFCC feature analyses for environmental audio scene analysis application. This comparison is the first known work presented to the best of the author’s knowledge. The database used in the present evaluation is explained, followed by the properties of the employed classifier and feature extraction methods Both algorithmic and hardware performances of these three feature analysis methods are provided and compared. The present Chapter explains the implementation of the pole modeling feature extraction + LDA classifier using ALTERA DE2 development board and Niose II embedded system design.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.