2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2021 Open Preview.

SPE-24: Speech Emotion 2: Neural Networks for Speech Emotion Recognition

Session Type: Poster
Time: Wednesday, 9 June, 15:30 - 16:15
Location: Gather.Town
Virtual Session: View on Virtual Platform
Session Chair: Carlos Busso, The University of Texas at Dallas
 
 SPE-24.1: A NOVEL END-TO-END SPEECH EMOTION RECOGNITION NETWORK WITH STACKED TRANSFORMER LAYERS
         Xianfeng Wang; Artificial Intelligence Application Research Center, Huawei Technologies
         Min Wang; Artificial Intelligence Application Research Center, Huawei Technologies
         Wenbo Qi; Artificial Intelligence Application Research Center, Huawei Technologies
         Wanqi Su; Artificial Intelligence Application Research Center, Huawei Technologies
         Xiangqian Wang; Artificial Intelligence Application Research Center, Huawei Technologies
         Huan Zhou; Artificial Intelligence Application Research Center, Huawei Technologies
 
 SPE-24.2: A NOVEL ATTENTION-BASED GATED RECURRENT UNIT AND ITS EFFICACY IN SPEECH EMOTION RECOGNITION
         Srividya Tirunellai Rajamani; University of Augsburg
         Kumar T. Rajamani; University of Lübeck
         Adria Mallol-Ragolta; University of Augsburg
         Shuo Liu; University of Augsburg
         Björn Schuller; University of Augsburg
 
 SPE-24.3: MAEC: MULTI-INSTANCE LEARNING WITH AN ADVERSARIAL AUTO-ENCODER-BASED CLASSIFIER FOR SPEECH EMOTION RECOGNITION
         Changzeng Fu; Osaka University
         Chaoran Liu; Advanced Telecommunications Research Institute International
         Carlos Toshinori Ishi; Advanced Telecommunications Research Institute International
         Hiroshi Ishiguro; Osaka University
 
 SPE-24.4: REPRESENTATION LEARNING WITH SPECTRO-TEMPORAL-CHANNEL ATTENTION FOR SPEECH EMOTION RECOGNITION
         Lili Guo; Tianjin University
         Longbiao Wang; Tianjin University
         Chenglin Xu; National University of Singapore
         Jianwu Dang; Tianjin University
         Eng Siong Chng; Nanyang Technological University
         Haizhou Li; National University of Singapore
 
 SPE-24.5: SPEECH EMOTION RECOGNITION USING QUATERNION CONVOLUTIONAL NEURAL NETWORKS
         Aneesh Muppidi; Stony Brook University
         Martin Radfar; Stony Brook University
 
 SPE-24.6: DOMAIN-ADVERSARIAL AUTOENCODER WITH ATTENTION BASED FEATURE LEVEL FUSION FOR SPEECH EMOTION RECOGNITION
         Yuan Gao; Tianjin University
         Jiaxing Liu; Tianjin University
         Longbiao Wang; Tianjin University
         Jianwu Dang; Japan Advanced Institute of Science and Technology