DI-UMONS : Dépôt institutionnel de l’université de Mons

Recherche transversale
Rechercher
(titres de publication, de périodique et noms de colloque inclus)
2013-04-29 - Article/Dans un journal avec peer-review - Anglais - 40 page(s)

Picart Benjamin , Drugman Thomas , Dutoit Thierry , "Analysis and HMM-based synthesis of hypo and hyperarticulated speech" in Computer Speech & Language, Volume 28, Issue 2, Pages 687 - 707, DOI: 10.1016/j.csl.2013.04.008

  • Edition : Academic Press
  • Codes CREF : Sciences de l'ingénieur (DI2000), Electricité courants faibles (DI2500)
  • Unités de recherche UMONS : Théorie des circuits et Traitement du signal (F105)
  • Instituts UMONS : Institut NUMEDIART pour les Technologies des Arts Numériques (Numédiart)
Texte intégral :

Abstract(s) :

(Anglais) Hypo and hyperarticulation refer to the production of speech with respectively a reduction and an increase of the articulatory efforts compared to the neutral style. Produced consciously or not, these variations of articulatory efforts depend upon the surrounding environment, the communication context and the motivation of the speaker with regard to the listener. The goal of this work is to integrate hypo and hyperarticulation into speech synthesizers, such that they are more realistic by automatically adapting their way of speaking to the contextual situation, like humans do. Based on our preliminary work, this paper provides a thorough and detailed study on the analysis and synthesis of hypo and hyperarticulated speech. It is divided into three parts. In the first one, we focus on both acoustic and phonetic modifications due to articulatory effort changes. The second part aims at developing a HMM-based speech synthesizer allowing a continuous control of the degree of articulation. This requires to first tackle the issue of speaking style adaptation to derive hypo and hyperarticulated speech from the neutral synthesizer. Once this is done, an interpolation and extrapolation of the resulting models enables to finely tune the voice so that it is generated with the desired articulatory efforts. Finally the third and last part focuses on a perceptual study of speech with a variable articulation degree, where it is analyzed how intelligibility and various other voice dimensions are affected.

Identifiants :
  • DOI : 10.1016/j.csl.2013.04.008

Mots-clés :
  • (Anglais) HTS
  • (Anglais) Speaking Style Adaptation
  • (Anglais) Speech Intelligibility
  • (Anglais) Speech Synthesis
  • (Anglais) Voice Quality
  • (Anglais) Expressive Speech
  • (Anglais) Speech Analysis