Since the characteristics of the glottal sound source due to vibration of the vocal cords have a great effect on the quality of synthesized speech, there have been intensive studies on glottal waves. It is known experimentally that the waveform is a rounded asymmetrical triangular wave. Many voiced source models have been proposed in which the glottal waves are parametrically represented as possible approaches toward a more natural synthesized speech. There are many unsolved problems, however, since the characteristics of the glottal source must be known for various speech utterances in order to construct the source model. Method of estimating the glottal wave from the observed speech signal include the inverse filtering method, where a filter with the inverse characteristic to the transfer function of the vocal tract is used. In this method, however, there remains the problem of how the essential error due to the separate estimations of the vocal-tract transfer function and the glottal wave can be eliminated. This paper proposes a new estimation algorithm for glottal waves, where the characteristics of the glottal waves and the vocal tract are estimated simultaneously by considering the vocal-tract transfer function, including the characteristics of the glottal source. In the proposed method, the speech generation process is represented by a nonminimum-phase model including the characteristics of the glottal source, and the glottal wave is estimated by estimating the parameters of the transfer function. In the estimation of the glottal wave, the unknown driving input signal must be estimated in parallel to the estimation of the transfer function parameters. An approximate inverse system is introduced in the proposed method, since the inverse system for the transfer function of the nonminimum-phase model is unstable. Using the proposed model, the glottal wave can be directly estimated when the vocal-tract characteristic can be represented by an all-pole model. It is also possible to use the nonminimum-phase ARMA model in this method for the analysis/synthesis of speech that includes glottal waves. The glottal wave is estimated by simulation as well as by observation of actual vowels, and satisfactory results are obtained, indicating the usefulness of the proposed estimation algorithm. © 1998 Scripta Technica, Electron Comm Jpn Pt 3, 81(11): 56–66, 1998