2013-05-24 - Colloque/Article dans les actes avec comité de lecture - Anglais - 5 page(s)

Brognaux Sandrine , Picart Benjamin , Drugman Thomas , "A New Prosody Annotation Protocol for Live Sports Commentaries" in Interspeech 2013, 1554-1558, Lyon, France, 2013

  • Codes CREF : Sciences de l'ingénieur (DI2000), Electricité courants faibles (DI2500)
  • Unités de recherche UMONS : Théorie des circuits et Traitement du signal (F105)
  • Instituts UMONS : Institut NUMEDIART pour les Technologies des Arts Numériques (Numédiart)
Texte intégral :

Abstract(s) :

(Anglais) This paper proposes a new prosody annotation protocol specific to live sports commentaries. Two levels of annotation are defined with HMM-based speech synthesis in view. Local labels are assigned to all syllables and refer to accentual phenomena. Global labels classify sequences of words into five distinct subgenres, defined in terms of valence and arousal. The objective of the study is to provide a set of labels both related to a specific function and characterized by a distinct acoustic realization. The consideration of these constraints should allow for an automatic prediction of the labels both from the text or from the speech signal. Reasonable inter-annotator scores are achieved for both annotation levels. A prosodic analysis of all labels also shows that they can usually be distinguished by specific acoustic realizations. The integration of this new annotation protocol within HMM-based speech synthesis shows promising results.

Mots-clés :
  • (Anglais) Expressive speech synthesis
  • (Anglais) Prosody
  • (Anglais) Sports Commentaries