SPE-4: Speech Synthesis 2: Controllability |
| Session Type: Poster |
| Time: Tuesday, 8 June, 13:00 - 13:45 |
| Location: Gather.Town |
| Session Chair: Yu Zhang, Google
|
| |
| SPE-4.1: PARALLEL TACOTRON: NON-AUTOREGRESSIVE AND CONTROLLABLE TTS |
| Isaac Elias; Google |
| Heiga Zen; Google |
| Jonathan Shen; Google |
| Yu Zhang; Google |
| Ye Jia; Google |
| Ron Weiss; Google |
| Yonghui Wu; Google |
| |
| SPE-4.2: FCL-TACO2: TOWARDS FAST, CONTROLLABLE AND LIGHTWEIGHT TEXT-TO-SPEECH SYNTHESIS |
| Disong Wang; The Chinese University of Hong Kong |
| Liqun Deng; Huawei Noah's Ark Lab |
| Yang Zhang; Huawei Noah's Ark Lab |
| Nianzu Zheng; Huawei Noah's Ark Lab |
| Yu Ting Yeung; Huawei Noah's Ark Lab |
| Xiao Chen; Huawei Noah's Ark Lab |
| Xunying Liu; The Chinese University of Hong Kong |
| Helen Meng; The Chinese University of Hong Kong |
| |
| SPE-4.3: PROSODIC CLUSTERING FOR PHONEME-LEVEL PROSODY CONTROL IN END-TO-END SPEECH SYNTHESIS |
| Alexandra Vioni; Innoetics, Samsung Electronics |
| Myrsini Christidou; Innoetics, Samsung Electronics |
| Nikolaos Ellinas; Innoetics, Samsung Electronics |
| Georgios Vamvoukakis; Innoetics, Samsung Electronics |
| Panos Kakoulidis; Innoetics, Samsung Electronics |
| Taehoon Kim; Mobile Communications Business, Samsung Electronics |
| June Sig Sung; Mobile Communications Business, Samsung Electronics |
| Hyoungmin Park; Mobile Communications Business, Samsung Electronics |
| Aimilios Chalamandaris; Innoetics, Samsung Electronics |
| Pirros Tsiakoulis; Innoetics, Samsung Electronics |
| |
| SPE-4.4: IMPROVING NATURALNESS AND CONTROLLABILITY OF SEQUENCE-TO-SEQUENCE SPEECH SYNTHESIS BY LEARNING LOCAL PROSODY REPRESENTATIONS |
| Cheng Gong; Tianjin University |
| Longbiao Wang; Tianjin University |
| Zhenhua Ling; University of Science and Technology of China |
| Shaotong Guo; Tianjin University |
| Ju Zhang; Huiyan Technology (Tianjin) Co., Ltd |
| Jianwu Dang; Japan Advanced Institute of Science and Technology |
| |
| SPE-4.5: MULTI-SPEAKER EMOTIONAL SPEECH SYNTHESIS WITH FINE-GRAINED PROSODY MODELING |
| Chunhui Lu; Samsung Research China-Beijing |
| Xue Wen; Samsung Research China-Beijing |
| Ruolan Liu; Samsung Research China-Beijing |
| Xiao Chen; Samsung Research China-Beijing |
| |
| SPE-4.6: EMOTION CONTROLLABLE SPEECH SYNTHESIS USING EMOTION-UNLABELED DATASET WITH THE ASSISTANCE OF CROSS-DOMAIN SPEECH EMOTION RECOGNITION |
| Xiong Cai; Tsinghua University |
| Dongyang Dai; Tsinghua University |
| Zhiyong Wu; Tsinghua University |
| Xiang Li; Tsinghua University |
| Jingbei Li; Tsinghua University |
| Helen Meng; Chinese University of Hong Kong |
| |