Average Group Delay Research Articles

The basic goal of the voice conversion system is to modify the speaker-specific characteristics, keeping the message and the environmental information contained in the speech signal intact. Speaker characteristics reflect in speech at different levels, such as, the shape of the glottal pulse (excitation source characteristics), the shape of the vocal tract (vocal tract system characteristics) and the long-term features (suprasegmental or prosodic characteristics). In this paper, we are proposing neural network models for developing mapping functions at each level. The features used for developing the mapping functions are extracted using pitch synchronous analysis. Pitch synchronous analysis provides the estimation of accurate vocal tract parameters, by analyzing the speech signal independently in each pitch period without influenced by the adjacent pitch cycles. In this work, the instants of significant excitation are used as pitch markers to perform the pitch synchronous analysis. The instants of significant excitation correspond to the instants of glottal closure (epochs) in the case of voiced speech, and to some random excitations like onset of burst in the case of nonvoiced speech. Instants of significant excitation are computed from the linear prediction (LP) residual of speech signals by using the property of average group-delay of minimum phase signals. In this paper, line spectral frequencies (LSFs) are used for representing the vocal tract characteristics, and for developing its associated mapping function. LP residual of the speech signal is viewed as excitation source, and the residual samples around the instant of glottal closure are used for mapping. Prosodic parameters at syllable and phrase levels are used for deriving the mapping function. Source and system level mapping functions are derived pitch synchronously, and the incorporation of target prosodic parameters is performed pitch synchronously using instants of significant excitation. The performance of the voice conversion system is evaluated using listening tests. The prediction accuracy of the mapping functions (neural network models) used at different levels in the proposed voice conversion system is further evaluated using objective measures such as deviation ( D i ) , root mean square error ( μ RMSE ) and correlation coefficient ( γ X , Y ). The proposed approach (i.e., mapping and modification of parameters using pitch synchronous approach) used for voice conversion is shown to be performed better compared to the earlier method (mapping the vocal tract parameters using block processing) proposed by the author.

Read full abstract

Many cells in the auditory brainstem ‘phase lock’ to tone stimuli. From the changing phase relationship between the stimulus and the neural response in phase-locking cells, the delay between them can be estimated. This delay, however, is consistently greater than the latency measured in response to click stimuli, an important discrepancy. In this paper the different measures of delay, namely phase delay, group delay and signal-front delay are re-examined. An improved method for computing the average group delay is presented, which accounts for the cyclical nature of the phase data. Data were collected from units in successive processing sites of auditory pathway: the auditory nerve, the cochlear nucleus, the trapezoid body and the medial nucleus of the trapezoid body. Low-characteristic frequency (CF) units gave multimodal post-stimulus-time histograms in response to clicks, and showed stepwise decreases in latency with increasing intensity, with the appearance of earlier peaks in the response, rather than shifts in the timing of the peaks. The separation of peaks corresponded to the inverse of the unit’s CF. High-CF units also showed a decline in click latency with intensity, but to a lesser degree than low CF units. We present an analysis which explains the difference between click latency and delay, and which in contrast to previous accounts is experimentally testable. We demonstrate that this new framework accounts for the discrepancy between the two measures of delay, and in addition accounts for the observed stepwise shifts in click latency for low-CF units.

Read full abstract

Average Group Delay Research Articles

Related Topics

Articles published on Average Group Delay

Wideband coplanar waveguide to edges-even broadside-coupled stripline transition

All-Order Polarization-Mode-Dispersion (PMD) Compensation at 40 Gb/s via Hyperfine Resolution Optical Pulse Shaping

Middle ear function and cochlear input impedance in chinchilla

Voice conversion by mapping the speaker-specific features using pitch synchronous approach

Integrated Bluetooth and UWB Antenna

Distributed PMD Compensation Experiment Using Polarizers

Prosody modification using instants of significant excitation

Enhanced polarization mode dispersion tolerance by optimizing receiver bandwidth

Polarization-mode dispersion compensation in WDM systems

Delay analysis in the auditory brainstem of the rat: comparison with click latency

In-field comparison among polarization-mode-dispersion measurement techniques

Systematic errors in indirect estimates of basilar membrane travel times.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Average Group Delay Research Articles

Related Topics

Articles published on Average Group Delay

Wideband coplanar waveguide to edges-even broadside-coupled stripline transition

All-Order Polarization-Mode-Dispersion (PMD) Compensation at 40 Gb/s via Hyperfine Resolution Optical Pulse Shaping

Middle ear function and cochlear input impedance in chinchilla

Voice conversion by mapping the speaker-specific features using pitch synchronous approach

Integrated Bluetooth and UWB Antenna

Distributed PMD Compensation Experiment Using Polarizers

Prosody modification using instants of significant excitation

Enhanced polarization mode dispersion tolerance by optimizing receiver bandwidth

Polarization-mode dispersion compensation in WDM systems

Delay analysis in the auditory brainstem of the rat: comparison with click latency

In-field comparison among polarization-mode-dispersion measurement techniques

Systematic errors in indirect estimates of basilar membrane travel times.