Vous constatez une erreur ?
NaN:NaN
00:00
The subject of automatic speech synthesis began to be popularised as early as the 1990s. Each of us has already had to deal with automatic answering machine voices that made us all suffer in the beginning. Today, however, the progress made both in terms of language comprehension and the acoustic quality of speech synthesis approaches have helped us make giant leaps forward, and new vocal services are currently seeing their quality and capabilities improve significantly with increasingly more human-sounding and expressive voices.In this presentation, I will briefly review recent advances in speech synthesis. After this introduction, I will discuss topics related to the customization of synthetic voices to the customer's needs; and this on several levels. First, at the level of the main components of oral expression: language, speech style, language register and gender for example. Then, I will address issues at the level of the utterance; prosodic for the most part (pitch and flow manipulation). Finally, I will finish by discussing the subsidiary elements to be taken into consideration in order to best meet the needs of customers and end-users of synthetic voices in our constantly changing world.
Mel-filterbanks are fixed, engineered audio features which emulate human perception and have been used through the history of audio understanding up to today. However, their undeniable qualities are counterbalanced by the fundamental limita
19 mars 2021 18 min
Deep Neural Networks are increasingly dominating the research activities in the Analysis/Synthesis team and elsewhere. The session will present some of the recent results of the research activities related to voice processing with deep neur
19 mars 2021 32 min
We will present the latest creative tools developed by the RepMus team (ACIDS project), enabling real-time audio synthesis as well as music generation and production and synthesizer control, all in open-source code, as well as Max4Live and
19 mars 2021 20 min
Neural style transfer applied to images has received considerable interest and has triggered many research activities aiming to use the underlying strategies for manipulation of music or sound. While the many fundamental differences between
19 mars 2021 20 min
19 mars 2021 30 min
Dans cette présentation, Greg Beller exposera les développements récents dans le domaine du traitement de la voix. Melodic Scale est un dispositif Max For Live qui modifie automatiquement une ligne mélodique en temps réel, en changeant sa g
19 mars 2021 26 min
19 mars 2021 20 min
An overview of AI for Music and Audio Generation I'll discuss recent advances in AI for music creation, focusing on Machine Learning (ML) and Human-Computer Interaction (HCI) coming from our Magenta project (g.co/magenta). I'll argue tha
19 mars 2021 47 min
The Musical Representations team explores the paradigm of computational creativity using devices inspired by artificial intelligence, particularly in the sense of new symbolic musician-machine interactions. The presentation will focus in pa
19 mars 2021 21 min
Vous constatez une erreur ?
1, place Igor-Stravinsky
75004 Paris
+33 1 44 78 48 43
Du lundi au vendredi de 9h30 à 19h
Fermé le samedi et le dimanche
Hôtel de Ville, Rambuteau, Châtelet, Les Halles
Institut de Recherche et de Coordination Acoustique/Musique
Copyright © 2022 Ircam. All rights reserved.