SPE-9: Speech Recognition 3: Transformer Models 1 |
| Session Type: Poster |
| Time: Tuesday, 8 June, 16:30 - 17:15 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Yangyang Shi, Facebook AI |
| SPE-9.1: TRANSFORMER-TRANSDUCERS FOR CODE-SWITCHED SPEECH RECOGNITION |
| Siddharth Dalmia; Carnegie Mellon University |
| Yuzong Liu; Amazon |
| Srikanth Ronanki; Amazon |
| Katrin Kirchhoff; Amazon |
| SPE-9.2: WAKE WORD DETECTION WITH STREAMING TRANSFORMERS |
| Yiming Wang; Johns Hopkins University |
| Hang Lv; Northwestern Polytechnical University |
| Daniel Povey; Xiaomi Corporation |
| Lei Xie; Northwestern Polytechnical University |
| Sanjeev Khudanpur; Johns Hopkins University |
| SPE-9.3: CAPTURING MULTI-RESOLUTION CONTEXT BY DILATED SELF-ATTENTION |
| Niko Moritz; Mitsubishi Electric Research Laboratories (MERL) |
| Takaaki Hori; Mitsubishi Electric Research Laboratories (MERL) |
| Jonathan Le Roux; Mitsubishi Electric Research Laboratories (MERL) |
| SPE-9.4: RECENT DEVELOPMENTS ON ESPNET TOOLKIT BOOSTED BY CONFORMER |
| Pengcheng Guo; Northwestern Polytechnical University; Johns Hopkins University |
| Florian Boyer; LaBRI, University of Bordeaux; Airudit |
| Xuankai Chang; Johns Hopkins University |
| Tomoki Hayashi; Nagoya University; Human Dataware Lab. Co., Ltd. |
| Yosuke Higuchi; Waseda University |
| Hirofumi Inaguma; Kyoto University |
| Naoyuki Kamo; NTT Corporation |
| Chenda Li; Shanghai Jiao Tong University |
| Daniel Garcia-Romero; Johns Hopkins University |
| Jiatong Shi; Johns Hopkins University |
| Jing Shi; Institute of Automation, Chinese Academy of Sciences, China and Johns Hopkins University |
| Shinji Watanabe; Johns Hopkins University, |
| Kun Wei; Northwestern Polytechnical University |
| Wangyou Zhang; Shanghai Jiao Tong University |
| Yuekai Zhang; Johns Hopkins University |
| SPE-9.5: HIERARCHICAL TRANSFORMER-BASED LARGE-CONTEXT END-TO-END ASR WITH LARGE-CONTEXT KNOWLEDGE DISTILLATION |
| Ryo Masumura; NTT Corporation |
| Naoki Makishima; NTT Corporation |
| Mana Ihori; NTT Corporation |
| Akihiko Takashima; NTT Corporation |
| Tomohiro Tanaka; NTT Corporation |
| Shota Orihashi; NTT Corporation |
| SPE-9.6: END-TO-END MULTI-CHANNEL TRANSFORMER FOR SPEECH RECOGNITION |
| Feng-Ju Chang; Amazon |
| Martin Radfar; Amazon |
| Athanasios Mouchtaris; Amazon |
| Brian King; Amazon |
| Siegfried Kunzmann; Amazon |