DI-UMONS : Dépôt institutionnel de l’université de Mons

Recherche transversale
Rechercher
(titres de publication, de périodique et noms de colloque inclus)
2009-04-19 - Colloque/Article dans les actes avec comité de lecture - Anglais - 4 page(s)

Drugman Thomas , Wilfart G., Moinet Alexis , Dutoit Thierry , "Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis" in ICASSP 2009 - International Conference on Acoustics, Speech and Signal Processing, pp. 3793-3796 , Taipei, Taiwan, 2009

  • Codes CREF : Electricité courants faibles (DI2500)
  • Unités de recherche UMONS : Théorie des circuits et traitement du signal (F105)
  • Instituts UMONS : Institut de Recherche en Technologies de l’Information et Sciences de l’Informatique (InforTech)
Texte intégral :

Abstract(s) :

(Anglais) This paper proposes a method to improve the quality delivered by statistical parametric speech synthesizers. For this, we use a codebook of pitch-synchronous residual frames, so as to construct a more realistic source signal. First a limited codebook of typical excitations is built from some training database. During the synthesis part, HMMs are used to generate filter and source coefficients. The latter coefficients contain both the pitch and a compact representation of target residual frames. The source signal is obtained by concatenating excitation frames picked up from the codebook, based on a selection criterion and taking target residual coefficients as input. Subjective results show a relevant improvement compared to the basic technique.

Identifiants :
  • DOI : 10.1109/ICASSP.2009.4960453