19 mars 2021 32 min
19 mars 2021 29 min
19 mars 2021 20 min
19 mars 2021 20 min
19 mars 2021 30 min
19 mars 2021 26 min
19 mars 2021 20 min
19 mars 2021 47 min
19 mars 2021 21 min
29 novembre 2006 20 min
29 novembre 2006 01 h 07 min
29 novembre 2006 59 min
29 novembre 2006 12 min
29 novembre 2006 50 min
29 novembre 2006 47 min
29 novembre 2006 18 min
29 novembre 2006 51 min
0:00/0:00
Mel-filterbanks are fixed, engineered audio features which emulate human perception and have been used through the history of audio understanding up to today. However, their undeniable qualities are counterbalanced by the fundamental limitations of handmade representations. In this talk, I will present LEAF, a new, lightweight, fully learnable neural network that can be used as a drop-in replacement of mel-filterbanks. LEAF learns all operations of audio features extraction, from filtering to pooling, compression and normalization, and can be integrated into any neural network at a negligible parameter cost, to adapt to the task at hand. I will show how LEAF outperforms mel-filterbanks on a wide range of audio signals, including speech, music, audio events and animal sounds, providing a general-purpose learned frontend for audio classification.
1, place Igor-Stravinsky
75004 Paris
+33 1 44 78 48 43
Du lundi au vendredi de 9h30 à 19h
Fermé le samedi et le dimanche
Hôtel de Ville, Rambuteau, Châtelet, Les Halles
Institut de Recherche et de Coordination Acoustique/Musique
Copyright © 2022 Ircam. All rights reserved.