2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information
Login Paper Search My Schedule Paper Index Help

My ICASSP 2021 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.
  1. Create a login based on your email (takes less than one minute)
  2. Perform 'Paper Search'
  3. Select papers that you desire to save in your personalized schedule
  4. Click on 'My Schedule' to see the current list of selected papers
  5. Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)
Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2021 Open Preview.

Clicking on the Add button next to a paper title will add that paper to your custom schedule.
Clicking on the Remove button next to a paper will remove that paper from your custom schedule.

SPE-44: Speech Recognition 16: Robust Speech Recognition 2

Session Type: Poster
Time: Thursday, 10 June, 16:30 - 17:15
Location: Gather.Town
Session Chair: Abdelrahman Mohamed, Facebook AI Research (FAIR)
 
   SPE-44.1: AN INVESTIGATION OF END-TO-END MODELS FOR ROBUST SPEECH RECOGNITION
         Archiki Prasad; Indian Institute of Technology, Bombay
         Preethi Jyothi; Indian Institute of Technology, Bombay
         Rajbabu Velmurugan; Indian Institute of Technology, Bombay
 
   SPE-44.2: END-TO-END DEREVERBERATION, BEAMFORMING, AND SPEECH RECOGNITION WITH IMPROVED NUMERICAL STABILITY AND ADVANCED FRONTEND
         Wangyou Zhang; Shanghai Jiao Tong University
         Christoph Boeddeker; Paderborn University
         Shinji Watanabe; Johns Hopkins University
         Tomohiro Nakatani; NTT Corporation
         Marc Delcroix; NTT Corporation
         Keisuke Kinoshita; NTT Corporation
         Tsubasa Ochiai; NTT Corporation
         Naoyuki Kamo; NTT Corporation
         Reinhold Haeb-Umbach; Paderborn University
         Yanmin Qian; Shanghai Jiao Tong University
 
   SPE-44.3: STREAMING MULTI-SPEAKER ASR WITH RNN-T
         Ilya Sklyar; Amazon
         Anna Piunova; Amazon
         Yulan Liu; Amazon
 
   SPE-44.4: IMPROVING RNN TRANSDUCER WITH TARGET SPEAKER EXTRACTION AND NEURAL UNCERTAINTY ESTIMATION
         Jiatong Shi; The Johns Hopkins University
         Chunlei Zhang; Tencent AI Lab
         Chao Weng; Tencent AI Lab
         Shinji Watanabe; The Johns Hopkins University
         Meng Yu; Tencent AI Lab
         Dong Yu; Tencent AI Lab
 
   SPE-44.5: A PROGRESSIVE LEARNING APPROACH TO ADAPTIVE NOISE AND SPEECH ESTIMATION FOR SPEECH ENHANCEMENT AND NOISY SPEECH RECOGNITION
         Zhaoxu Nian; University of Science and Technology of China
         Yan-Hui Tu; University of Science and Technology of China
         Jun Du; University of Science and Technology of China
         Chin-Hui Lee; Georgia Institute of Technology
 
   SPE-44.6: THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE 2020: OPEN DATASETS, TRACKS, BASELINES, RESULTS AND METHODS
         Xian Shi; Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University
         Fan Yu; Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University
         Yizhou Lu; SpeechLab, Department of Computer Science and Engineering, Shanghai Jiao Tong University
         Yuhao Liang; Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University
         Qiangze Feng; Datatang (Beijing) Technology Co., LTD
         Daliang Wang; Datatang (Beijing) Technology Co., LTD
         Yanmin Qian; SpeechLab, Department of Computer Science and Engineering, Shanghai Jiao Tong University
         Lei Xie; Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University