| AUD-33: Topics in Deep Learning for Speech and Audio | 
| Session Type: Poster | 
| Time: Friday, 11 June, 14:00 - 14:45 | 
| Location: Gather.Town | 
| Virtual Session: View on Virtual Platform | 
| Session Chair: Hirokazu Kameoka, Nippon Telegraph and Telephone Corporation | 
| AUD-33.1: UNIDIRECTIONAL MEMORY-SELF-ATTENTION TRANSDUCER FOR ONLINE SPEECH RECOGNITION | 
| Jian Luo; Ping An Technology (Shenzhen) Co., Ltd. | 
| Jianzong Wang; Ping An Technology (Shenzhen) Co., Ltd. | 
| Ning Cheng; Ping An Technology (Shenzhen) Co., Ltd. | 
| Jing Xiao; Ping An Technology (Shenzhen) Co., Ltd. | 
| AUD-33.2: ACCDOA: ACTIVITY-COUPLED CARTESIAN DIRECTION OF ARRIVAL REPRESENTATION FOR SOUND EVENT LOCALIZATION AND DETECTION | 
| Kazuki Shimada; Sony Corporation | 
| Yuichiro Koyama; Sony Corporation | 
| Naoya Takahashi; Sony Corporation | 
| Shusuke Takahashi; Sony Corporation | 
| Yuki Mitsufuji; Sony Corporation | 
| AUD-33.3: SEEN AND UNSEEN EMOTIONAL STYLE TRANSFER FOR VOICE CONVERSION WITH A NEW EMOTIONAL SPEECH DATASET | 
| Kun Zhou; National University of Singapore | 
| Berrak Sisman; Singapore University of Technology and Design | 
| Rui Liu; Singapore University of Technology and Design | 
| Haizhou Li; National University of Singapore | 
| AUD-33.4: U-CONVOLUTION BASED RESIDUAL ECHO SUPPRESSION WITH MULTIPLE ENCODERS | 
| Eesung Kim; Kakao Enterprise | 
| Jae-Jin Jeon; Kakao Enterprise | 
| Hyeji Seo; Kakao Enterprise | 
| AUD-33.5: A MULTI-CHANNEL TEMPORAL ATTENTION CONVOLUTIONAL NEURAL NETWORK MODEL FOR ENVIRONMENTAL SOUND CLASSIFICATION | 
| You Wang; Georgia Institute of Technology | 
| Chuyao Feng; Georgia Institute of Technology | 
| David Anderson; Georgia Institute of Technology | 
| AUD-33.6: A GENERAL NETWORK ARCHITECTURE FOR SOUND EVENT LOCALIZATION AND DETECTION USING TRANSFER LEARNING AND RECURRENT NEURAL NETWORK | 
| Thi Ngoc Tho Nguyen; Nanyang Technological University | 
| Ngoc Khanh Nguyen; Motional | 
| Huy Phan; Queen Mary University of London | 
| Lam Pham; Austrian Institute of Technology | 
| Kenneth Ooi; Nanyang Technological University | 
| Douglas L. Jones; University of Illinois at Urbana-Champaign | 
| Woon-Seng Gan; Nanyang Technological University |