Follow
Jason Fong
Title
Cited by
Cited by
Year
Where do the improvements come from in sequence-to-sequence neural TTS?
O Watts, GE Henter, J Fong, C Valentini-Botinhao
2019 ISCA Speech Synthesis Workshop (SSW) 10, 217-222, 2019
342019
A comparison between letters and phones as input to sequence-to-sequence models for speech synthesis
J Fong, J Taylor, K Richmond, S King
10th ISCA Speech Synthesis Workshop, 223-227, 2019
302019
Multilingual text-to-speech training using cross language voice conversion and self-supervised learning of speech representations
J Wu, A Polyak, Y Taigman, J Fong, P Agrawal, Q He
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
132022
Investigating the Robustness of Sequence-to-Sequence Text-to-Speech Models to Imperfectly-Transcribed Training Data.
J Fong, PO Gallegos, Z Hodari, S King
INTERSPEECH, 1546-1550, 2019
112019
Exploring disentanglement with multilingual and monolingual vq-vae
J Williams, J Fong, E Cooper, J Yamagishi
Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 2021
82021
Testing the Limits of Representation Mixing for Pronunciation Correction in End-to-End Speech Synthesis.
J Fong, J Taylor, S King
INTERSPEECH, 4019-4023, 2020
52020
Analysing Temporal Sensitivity of VQ-VAE Sub-Phone Codebooks
J Fong, J Williams, S King
Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 227-231, 2021
22021
Improving Polyglot Speech Synthesis through Multi-task and Adversarial Learning
J Fong, J Wu, P Agrawal, A Gibiansky, T Koehler, Q He
Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 172-176, 2021
22021
Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech
J Fong, D Lyth, GE Henter, H Tang, S King
Proc. Interspeech 2022, 1213-1217, 2022
12022
Spell4TTS: Acoustically-informed spellings for improving text-to-speech pronunciations
J Fong, H Tang, S King
12th Speech Synthesis Workshop (SSW) 2023, 2023
2023
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders
J Fong, Y Wang, P Agrawal, V Manohar, J Wu, T Köhler, Q He
arXiv preprint arXiv:2210.16045, 2022
2022
Listening-test materials for" Where do the improvements come from in sequence-to-sequence neural TTS?"
O Watts, GE Henter, J Fong, C Valentini-Botinhao
纸飞机 tv 版软件官方下载 of Edinburgh. School of Informatics. Centre for …, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–12