SPE-11: Voice Conversion 1: Non-parallel Conversion |
| Session Type: Poster |
| Time: Tuesday, 8 June, 16:30 - 17:15 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Tomoki Toda, Nagoya University |
| SPE-11.1: MASKCYCLEGAN-VC: LEARNING NON-PARALLEL VOICE CONVERSION WITH FILLING IN FRAMES |
| Takuhiro Kaneko; NTT Corporation |
| Hirokazu Kameoka; NTT Corporation |
| Kou Tanaka; NTT Corporation |
| Nobukatsu Hojo; NTT Corporation |
| SPE-11.2: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION BY KNOWLEDGE TRANSFER FROM A TEXT-TO-SPEECH MODEL |
| Xinyuan Yu; The Hong Kong University of Science and Technology |
| Brian Mak; The Hong Kong University of Science and Technology |
| SPE-11.3: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING LOCAL LINGUISTIC TOKENS |
| Chao Wang; Soochow University |
| Yibiao Yu; Soochow University |
| SPE-11.4: CRANK: AN OPEN-SOURCE SOFTWARE FOR NONPARALLEL VOICE CONVERSION BASED ON VECTOR-QUANTIZED VARIATIONAL AUTOENCODER |
| Kazuhiro Kobayashi; Nagoya University |
| Wen-Chin Huang; Nagoya University |
| Yi-Chiao Wu; Nagoya University |
| Patrick Lumban Tobing; Nagoya University |
| Tomoki Hayashi; Nagoya University |
| Tomoki Toda; Nagoya University |
| SPE-11.5: FRAGMENTVC: ANY-TO-ANY VOICE CONVERSION BY END-TO-END EXTRACTING AND FUSING FINE-GRAINED VOICE FRAGMENTS WITH ATTENTION |
| Yist Y. Lin; National Taiwan University |
| Chung-Ming Chien; National Taiwan University |
| Jheng-Hao Lin; National Taiwan University |
| Hung-yi Lee; National Taiwan University |
| Lin-shan Lee; National Taiwan University |
| SPE-11.6: ANY-TO-ONE SEQUENCE-TO-SEQUENCE VOICE CONVERSION USING SELF-SUPERVISED DISCRETE SPEECH REPRESENTATIONS |
| Wen-Chin Huang; Nagoya University |
| Yi-Chiao Wu; Nagoya University |
| Tomoki Hayashi; Nagoya University |
| Tomoki Toda; Nagoya University |