GLOAT (GLOttal Analysis Toolbox) is a Matlab toolbox for glottal analysis from audio recordings of speech. It includes several scripts developed by Thomas Drugman at TCTS Lab, University of Mons (Belgium). These scripts achieve the following functions:
- F0 and voiced-unvoiced decision estimation:
The algorithm based on the Summation of Residual Harmonics (SRH) has been shown to provide a robust estimation of both F0 and the voicing decisions in noisy conditions. Its use in especially severe conditions is recommended. For more information, please see:
T.Drugman, A.Alwan, Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics, Interspeech11, Firenze, Italy, 2011.
- Speech polarity detection:
The Oscillating Moment-based Polarity Detection (OMPD) relies on the phase shift between two oscillating moments (differing from their statistics order) and has been shown to outperform other state-of-the-art techniques. For more information, please see:
T.Drugman, T.Dutoit, Oscillating Statistical Moments for Speech Polarity Detection, Non-Linear Speech Processing Workshop (NOLISP11), Las Palmas, Gran Canaria, Spain, 2011.
- Glottal Closure Instant (GCI) detection:
The Speech Event Detection using the Residual Excitation And a Mean-based Signal (SEDREAMS) is a technique allowing a reliable, accurate and robust estimation of the GCI locations. For more information, please see:
T. Drugman, M. Thomas, J. Gudnason, P. Naylor, T. Dutoit, Detection of Glottal Closure Instants from Speech Signals: a Quantitative Review, IEEE Transactions on Audio, Speech and Language Processing, Accepted for publication.
T.Drugman, T.Dutoit, Glottal Closure and Opening Instant Detection from Speech Signals, Interspeech09, Brighton, U.K, 2009.
- Glottal source estimation:
Based on the mixed-phase properties of speech, the complex cepstrum-based decomposition allows to isolate the maximum-phase component of speech, which is related to the open phase of the glottal flow. For more information, please see:
T.Drugman, B.Bozkurt, T.Dutoit, Causal-anticausal Decomposition of Speech using Complex Cepstrum for Glottal Source Estimation, Speech Communication, Volume 53, Issue 6, July 2011, Pages 855-866, 2011.
T.Drugman, B.Bozkurt, T.Dutoit, Complex Cepstrum-based Decomposition of Speech for Glottal Source Estimation, Interspeech09, Brighton, U.K, 2009.
The toolbox is available here.
|