AUD-32: Audio for Multimedia and Audio Processing Systems |
| Session Type: Poster |
| Time: Friday, 11 June, 14:00 - 14:45 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Daniele Giacobello, Sonos Inc. |
| AUD-32.1: LIGHTWEIGHT AND INTERPRETABLE NEURAL MODELING OF AN AUDIO DISTORTION EFFECT USING HYPERCONDITIONED DIFFERENTIABLE BIQUADS |
| Shahan Nercessian; iZotope, Inc. |
| Andy Sarroff; iZotope, Inc. |
| Kurt James Werner; iZotope, Inc. |
| AUD-32.3: ATTACKING AND DEFENDING BEHIND A PSYCHOACOUSTICS-BASED CAPTCHA |
| Chih-Hsiang Huang; National Tsing Hua University |
| Po-Hao Wu; National Tsing Hua University |
| Yi-Wen Liu; National Tsing Hua University |
| Shan-Hung Wu; National Tsing Hua University |
| AUD-32.4: DOUBLE-DCCCAE: ESTIMATION OF BODY GESTURES FROM SPEECH WAVEFORM |
| JinHong Lu; University of Edinburgh |
| TianHang Liu; University of Edinburgh |
| Shuzhuang Xu; University of Edinburgh |
| Hiroshi Shimodaira; University of Edinburgh |
| AUD-32.5: AUDIO REPLAY SPOOF ATTACK DETECTION BY JOINT SEGMENT-BASED LINEAR FILTER BANK FEATURE EXTRACTION AND ATTENTION-ENHANCED DENSENET-BILSTM NETWORK |
| Lian Huang; University of Macau |
| Chi-Man Pun; University of Macau |
| AUD-32.6: INVESTIGATING LOCAL AND GLOBAL INFORMATION FOR AUTOMATED AUDIO CAPTIONING WITH TRANSFER LEARNING |
| Xuenan Xu; Shanghai Jiao Tong University |
| Heinrich Dinkel; Shanghai Jiao Tong University |
| Mengyue Wu; Shanghai Jiao Tong University |
| Zeyu Xie; Shanghai Jiao Tong University |
| Kai Yu; Shanghai Jiao Tong University |