Abstract

The synthesis of convincing acoustic drum sounds remains an open problem. In this paper, a method for analysing and synthesising pitch glide in drums is proposed, whereby the discrete cosine transform (DCT) of an unwindowed drum sound is modelled. This is an extension of the scheme initially proposed by Kirby and Sandler [(2020). Proceedings of the 23rd International Conference on Digital Audio Effects, Vienna, Austria, pp. 155-162], which was able to reproduce key components of drum sounds accurately enough that they could not be distinguished from the reference samples. Here, drum modes were analysed in greater detail for a tom-tom struck at 67 different intensities to investigate their evolution with strike velocity. A clear evolution was observed in the DCT features, and interpolation was used to synthesise the modes of intermediate velocity. These synthesised modes were evaluated objectively through null testing, which showed that a continuous blending of strike velocities could be achieved throughout the data set. An AB listening test was also performed, where 20 participants attempted to distinguish between pairs of real and synthesised sounds. Exactly 50% accuracy was achieved overall, which demonstrates that the synthesised samples were deemed to sound as realistic as genuine samples. These results demonstrate that the DCT representation is a valuable framework for analysis and synthesis of drum sounds. It is also likely that this approach could be applied to other instruments.

Highlights

  • In this paper, we reexamine modal behaviour in drums through a novel framework that incorporates the discrete cosine transform (DCT) and Hilbert transform

  • Drum modes were analysed in greater detail for a tom-tom struck at 67 different intensities to investigate their evolution with strike velocity

  • A clear evolution was observed in the DCT features, and interpolation was used to synthesise the modes of intermediate velocity

Read more

Summary

INTRODUCTION

We reexamine modal behaviour in drums through a novel framework that incorporates the discrete cosine transform (DCT) and Hilbert transform. This representation offers a new perspective on modal oscillation, allowing us to clearly track the evolution of modal oscillation with increasing strike velocity. This analysis is performed on the fundamental mode of a tom-tom. Interpolation based synthesis is used here to generate highly accurate simulations of the modal behaviour at intermediate strike velocities for the fundamental mode This interpolation based method is merely one example that the DCT representation can be used for dynamic synthesis. The synthesised modes are evaluated through objective and perceptual means to gauge the accuracy of the synthesised intermediate velocity behaviour

Modeling instruments
The DCT
Relationship to the inverse fast Fourier ftransform synthesis
THEORETICAL BASIS
Data set
IDCT synthesis of drum sounds
DCT representation of a resonant mode
Decomposing modal features via the Hilbert transform
ANALYSIS OF THE DATA SET
Evolution of the DCT representation with the strike velocity
Interpolation of the DCT representation
EVALUATION OF SYNTHESIS
Initial test
Method
Results
PERCEPTUAL EVALUATION OF THE SYNTHESIS
Systematic sampling
Synthesis method
Experimental design
RESULTS AND DISCUSSION
VIII. CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call