AUD-30: Detection and Classification of Acoustic Scenes and Events 5: Scenes |
| Session Type: Poster |
| Time: Friday, 11 June, 13:00 - 13:45 |
| Location: Gather.Town |
| Session Chair: Mark Cartwright, New York University
|
| |
| AUD-30.1: CROSS-MODAL SPECTRUM TRANSFORMATION NETWORK FOR ACOUSTIC SCENE CLASSIFICATION |
| Yang Liu; University of Surrey |
| Alexandros Neophytou; Microsoft |
| Sunando Sengupta; Microsoft |
| Eric Sommerlade; Microsoft |
| |
| AUD-30.2: DOMESTIC ACTIVITIES CLUSTERING FROM AUDIO RECORDINGS USING CONVOLUTIONAL CAPSULE AUTOENCODER NETWORK |
| Ziheng Lin; School of Electronic and Information Engineering, South China University of Technology, Guangzhou |
| Yanxiong Li; School of Electronic and Information Engineering, South China University of Technology, Guangzhou |
| Zhangjin Huang; School of Electronic and Information Engineering, South China University of Technology, Guangzhou |
| Wenhao Zhang; School of Electronic and Information Engineering, South China University of Technology, Guangzhou |
| Yufeng Tan; School of Electronic and Information Engineering, South China University of Technology, Guangzhou |
| Yichun Chen; School of Electronic and Information Engineering, South China University of Technology, Guangzhou |
| Qianhua He; School of Electronic and Information Engineering, South China University of Technology, Guangzhou |
| |
| AUD-30.3: SOUND EVENT DETECTION AND SEPARATION: A BENCHMARK ON DESED SYNTHETIC SOUNDSCAPES |
| Nicolas Turpault; Université de Lorraine, CNRS, Inria, Loria |
| Romain Serizel; Université de Lorraine, CNRS, Inria, Loria |
| Scott Wisdom; Google Research |
| Hakan Erdogan; Google Research |
| John R. Hershey; Google Research |
| Eduardo Fonseca; Universitat Pompeu Fabra |
| Prem Seetharaman; Descript, Inc. |
| Justin Salamon; Adobe Research |
| |
| AUD-30.4: A TWO-STAGE APPROACH TO DEVICE-ROBUST ACOUSTIC SCENE CLASSIFICATION |
| Hu Hu; Georgia Institute of Technology |
| Chao-Han Yang; Georgia Institute of Technology |
| Xianjun Xia; Tencent Media Lab |
| Xue Bai; University of Science and Technology of China |
| Xin Tang; University of Science and Technology of China |
| Yajian Wang; University of Science and Technology of China |
| Shutong Niu; University of Science and Technology of China |
| Li Chai; University of Science and Technology of China |
| Juanjuan Li; Tencent Media Lab |
| Hongning Zhu; Tencent Media Lab |
| Feng Bao; Tencent Media Lab |
| Yuanjun Zhao; Tencent Media Lab |
| Sabato Marco Siniscalchi; University of Enna Kore |
| Yannan Wang; Tencent Media Lab |
| Jun Du; University of Science and Technology of China |
| Chin-Hui Lee; Georgia Institute of Technology |
| |
| AUD-30.5: SUBSPECTRAL NORMALIZATION FOR NEURAL AUDIO DATA PROCESSING |
| Simyung Chang; Qualcomm AI Research |
| Hyoungwoo Park; Qualcomm AI Research |
| Janghoon Cho; Qualcomm AI Research |
| Hyunsin Park; Qualcomm AI Research |
| Sungrack Yun; Qualcomm AI Research |
| Kyuwoong Hwang; Qualcomm AI Research |
| |
| AUD-30.6: SLOW-FAST AUDITORY STREAMS FOR AUDIO RECOGNITION |
| Evangelos Kazakos; University of Bristol |
| Arsha Nagrani; University of Oxford |
| Andrew Zisserman; University of Oxford |
| Dima Damen; University of Bristol |
| |