MLSP-24: Applications in Audio and Speech Processing |
| Session Type: Poster |
| Time: Wednesday, 9 June, 16:30 - 17:15 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Sven Shepstone, Bang & Olufsen |
| MLSP-24.1: WASSERSTEIN BARYCENTER TRANSPORT FOR ACOUSTIC ADAPTATION |
| Eduardo Fernandes Montesuma; Universidade Federal do Ceará |
| Fred-Maurice Ngolè Mboula; Université Paris-Saclay |
| MLSP-24.2: EFFICIENT ADVERSARIAL AUDIO SYNTHESIS VIA PROGRESSIVE UPSAMPLING |
| Youngwoo Cho; Korea Advanced Institute of Science and Technology (KAIST) |
| Minwook Chang; NCSOFT |
| Sanghyeon Lee; Korea Advanced Institute of Science and Technology (KAIST) |
| Hyoungwoo Lee; Korea University |
| Gerard Jounghyun Kim; Korea University |
| Jaegul Choo; Korea Advanced Institute of Science and Technology (KAIST) |
| MLSP-24.3: MULTI-CHANNEL SPEECH ENHANCEMENT USING GRAPH NEURAL NETWORKS |
| Panagiotis Tzirakis; Facebook |
| Anurag Kumar; Facebook |
| Jacob Donley; Facebook |
| MLSP-24.4: MULTI-DECODER DPRNN: SOURCE SEPARATION FOR VARIABLE NUMBER OF SPEAKERS |
| Junzhe Zhu; University of Illinois at Urbana-Champaign |
| Raymond Yeh; University of Illinois at Urbana-Champaign |
| Mark Hasegawa-Johnson; University of Illinois at Urbana-Champaign |
| MLSP-24.5: DATA-EFFICIENT FRAMEWORK FOR REAL-WORLD MULTIPLE SOUND SOURCE 2D LOCALIZATION |
| Guillaume Le Moing; Inria, Ecole normale superieure, CNRS, PSL Research University |
| Phongtharin Vinayavekhin; IBM Research |
| Don Joven Agravante; IBM Research |
| Tadanobu Inoue; IBM Research |
| Jayakorn Vongkulbhisal; IBM Research |
| Asim Munawar; IBM Research |
| Ryuki Tachibana; IBM Research |
| MLSP-24.6: FUSING INFORMATION STREAMS IN END-TO-END AUDIO-VISUAL SPEECH RECOGNITION |
| Wentao Yu; Ruhr University Bochum |
| Steffen Zeiler; Ruhr University Bochum |
| Dorothea Kolossa; Ruhr University Bochum |