AUD-23: Detection and Classification of Acoustic Scenes and Events 4: Datasets and metrics |
| Session Type: Poster |
| Time: Thursday, 10 June, 15:30 - 16:15 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Scott Wisdom, Google |
| AUD-23.1: A CURATED DATASET OF URBAN SCENES FOR AUDIO-VISUAL SCENE ANALYSIS |
| Shanshan Wang; Tampere University |
| Annamaria Mesaros; Tampere University |
| Toni Heittola; Tampere University |
| Tuomas Virtanen; Tampere University |
| AUD-23.2: IMPROVING SOUND EVENT DETECTION METRICS: INSIGHTS FROM DCASE 2020 |
| Giacomo Ferroni; Audio Analytic |
| Nicolas Turpault; INRIA |
| Juan Azcarreta; Audio Analytic |
| Francesco Tuveri; Audio Analytic |
| Romain Serizel; LORIA |
| Cagdas Bilen; Audio Analytic |
| Sacha Krstulovic; Audio Analytic |
| AUD-23.3: ARTIFICIALLY SYNTHESISING DATA FOR AUDIO CLASSIFICATION AND SEGMENTATION TO IMPROVE SPEECH AND MUSIC DETECTION IN RADIO BROADCAST |
| Satvik Venkatesh; University of Plymouth |
| David Moffat; University of Plymouth |
| Alexis Kirke; University of Plymouth |
| Gözel Shakeri; University of Glasgow |
| Stephen Brewster; University of Glasgow |
| Jörg Fachner; Anglia Ruskin University |
| Helen Odell-Miller; Anglia Ruskin University |
| Alex Street; Anglia Ruskin University |
| Nicolas Farina; Brighton and Sussex Medical School |
| Sube Banerjee; University of Plymouth |
| Eduardo Reck Miranda; University of Plymouth |
| AUD-23.4: LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION |
| Weiquan Fan; South China University of Technology |
| Xiangmin Xu; South China University of Technology |
| Xiaofen Xing; South China University of Technology |
| Weidong Chen; South China University of Technology |
| Dongyan Huang; UBTECH Robotics Corp |
| AUD-23.5: ENHANCING AUDIO AUGMENTATION METHODS WITH CONSISTENCY LEARNING |
| Turab Iqbal; University of Surrey |
| Karim Helwani; Amazon Web Services |
| Arvindh Krishnaswamy; Amazon Web Services |
| Wenwu Wang; University of Surrey |
| AUD-23.6: FAST THRESHOLD OPTIMIZATION FOR MULTI-LABEL AUDIO TAGGING USING SURROGATE GRADIENT LEARNING |
| Thomas Pellegrini; Université de Toulouse III ; IRIT |
| Timothée Masquelier; CERCO UMR 5549, CNRS ; Université de Toulouse III |