DI-UMONS : Dépôt institutionnel de l’université de Mons

Recherche transversale
(titres de publication, de périodique et noms de colloque inclus)
2008-08-25 - Colloque/Article dans les actes avec comité de lecture - Anglais - 5 page(s)

Drugman Thomas , Dubuisson Thomas , Moinet Alexis , D'alessandro Nicolas, Dutoit Thierry , "Voice Source Parameters Estimation by Fitting the Glottal Formant and the Inverse Filtering Open Phase" in 16th European Signal Processing Conference, Lausanne, Suisse, 2008

  • Codes CREF : Technologies de l'information et de la communication (TIC) (DI4730), Electricité courants faibles (DI2500)
  • Unités de recherche UMONS : Théorie des circuits et traitement du signal (F105)
  • Instituts UMONS : Institut de Recherche en Technologies de l’Information et Sciences de l’Informatique (InforTech)
Texte intégral :

Abstract(s) :

(Anglais) This paper presents two approaches to the problem of extracting the parameters of the LF source model directly from the speech waveform. The first approach relies on the glottal formant estimated from the anticausal contribution of speech. Indeed the ZZT technique has recently shown its ability to deconvolve speech into its causal and anticausal components. The second method is based on the glottal open phase obtained by inverse filtering. The notion of unanalyzable frames and the way to detect and correct them are also presented. Once source parameters are extracted, the coefficients of the ARX speech production model are estimated by spectral division. Decomposition on both synthetic and natural speech, as well as an analysis-synthesis test confirm the accuracy of methods exposed.