SPE-54: End-to-End Speaker Diarization and Recognition |
| Session Type: Poster |
| Time: Friday, 11 June, 13:00 - 13:45 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Man-Wai Mak, The Hong Kong Polytechnic University |
| SPE-54.1: END-TO-END DIARIZATION FOR VARIABLE NUMBER OF SPEAKERS WITH LOCAL-GLOBAL NETWORKS AND DISCRIMINATIVE SPEAKER EMBEDDINGS |
| Soumi Maiti; CUNY |
| Hakan Erdogan; Google |
| Kevin Wilson; Google |
| Scott Wisdom; Google |
| Shinji Watanabe; Johns Hopkins University |
| John R. Hershey; Google |
| SPE-54.2: END-TO-END SPEAKER DIARIZATION AS POST-PROCESSING |
| Shota Horiguchi; Hitachi, Ltd. |
| Paola Garcia; Johns Hopkins University |
| Yusuke Fujita; Hitachi, Ltd. |
| Shinji Watanabe; Johns Hopkins University |
| Kenji Nagamatsu; Hitachi, Ltd. |
| SPE-54.3: BW-EDA-EEND: STREAMING END-TO-END NEURAL SPEAKER DIARIZATION FOR A VARIABLE NUMBER OF SPEAKERS |
| Eunjung Han; Amazon |
| Chul Lee; Amazon |
| Andreas Stocke; Amazon |
| SPE-54.4: INTEGRATING END-TO-END NEURAL AND CLUSTERING-BASED DIARIZATION: GETTING THE BEST OF BOTH WORLDS |
| Keisuke Kinoshita; NTT Corporation |
| Marc Delcroix; NTT Corporation |
| Naohiro Tawara; NTT Corporation |
| SPE-54.5: SIAMESE CAPSULE NETWORK FOR END-TO-END SPEAKER RECOGNITION IN THE WILD |
| Amirhossein Hajavi; Queen's University |
| Ali Etemad; Queen's University |
| SPE-54.6: A REAL-TIME SPEAKER DIARIZATION SYSTEM BASED ON SPATIAL SPECTRUM |
| Siqi Zheng; Alibaba Group |
| Weilong Huang; Alibaba Group |
| Xianliang Wang; Alibaba Group |
| Hongbin Suo; Alibaba Group |
| Jinwei Feng; Alibaba Group |
| Zhijie Yan; Alibaba Group |