Towards helpful, customer-specific Text-To-Speech synthesis

pôle documentaire

Vous constatez une erreur ?

informations

set: Ateliers du Forum
évènements: Ateliers du Forum 2021
Type: Ensemble de conférences, symposium, congrès
Lieu de représentation: Ircam, Salle Igor-Stravinsky (Paris)
durée: 29 min
date: 19 mars 2021

The subject of automatic speech synthesis began to be popularised as early as the 1990s. Each of us has already had to deal with automatic answering machine voices that made us all suffer in the beginning. Today, however, the progress made both in terms of language comprehension and the acoustic quality of speech synthesis approaches have helped us make giant leaps forward, and new vocal services are currently seeing their quality and capabilities improve significantly with increasingly more human-sounding and expressive voices.In this presentation, I will briefly review recent advances in speech synthesis. After this introduction, I will discuss topics related to the customization of synthetic voices to the customer's needs; and this on several levels. First, at the level of the main components of oral expression: language, speech style, language register and gender for example. Then, I will address issues at the level of the utterance; prosodic for the most part (pitch and flow manipulation). Finally, I will finish by discussing the subsidiary elements to be taken into consideration in order to best meet the needs of customers and end-users of synthetic voices in our constantly changing world.

intervenants

Towards helpful, customer-specific Text-To-Speech synthesis

informations

intervenants

Les médias liés à cet évènement

From psychoacoustics to deep learning: learning low-level processing of sound with neural networks

Deep Learning for Voice processing

Tools for creative AI and noise

Xtextures - Convolutional neural networks for texture synthesis and cross synthesis

Round Table IA : Questions/discussions

Melodic Scale and Virtual Choir, Max ISiS

Greg Beller, David Guennec, Nicolas Obin, Axel Roebel, Hugues Vinet. Table ronde

Session IA - An overview of AI for Music and Audio Generation

Interaction with musical generative agents

partager

IRCAM

heures d'ouverture

accès en transports