Jason Fong

Cited by

	All	Since 2019
Citations	106	106
h-index	5	5
i10-index	4	4

2019202020212022202320243 15 24 35 22 7

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Simon KingProfessor of Speech Processing, University of EdinburghVerified email at ed.ac.uk
Gustav Eje HenterKTH Royal Institute of Technology, Stockholm, SwedenVerified email at kth.se
Jason TaylorSchool of Informatics, University of EdinburghVerified email at ed.ac.uk
Cassia Valentini-BotinhaoUniversity of EdinburghVerified email at inf.ed.ac.uk
Korin RichmondCentre for Speech Technology Research, University of EdinburghVerified email at cstr.ed.ac.uk
Jennifer WilliamsAssistant Professor at University of Southampton (UK)Verified email at soton.ac.uk
Zack HodariResearch Engineer, PapercupVerified email at papercup.com

Jason Fong

PhD Student, The University of Edinburgh

Verified email at ed.ac.uk

Speech Synthesis Natural Language Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Where do the improvements come from in sequence-to-sequence neural TTS? O Watts, GE Henter, J Fong, C Valentini-Botinhao 2019 ISCA Speech Synthesis Workshop (SSW) 10, 217-222, 2019	34	2019
A comparison between letters and phones as input to sequence-to-sequence models for speech synthesis J Fong, J Taylor, K Richmond, S King 10th ISCA Speech Synthesis Workshop, 223-227, 2019	30	2019
Multilingual text-to-speech training using cross language voice conversion and self-supervised learning of speech representations J Wu, A Polyak, Y Taigman, J Fong, P Agrawal, Q He ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	13	2022
Investigating the Robustness of Sequence-to-Sequence Text-to-Speech Models to Imperfectly-Transcribed Training Data. J Fong, PO Gallegos, Z Hodari, S King INTERSPEECH, 1546-1550, 2019	11	2019
Exploring disentanglement with multilingual and monolingual vq-vae J Williams, J Fong, E Cooper, J Yamagishi Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 2021	8	2021
Testing the Limits of Representation Mixing for Pronunciation Correction in End-to-End Speech Synthesis. J Fong, J Taylor, S King INTERSPEECH, 4019-4023, 2020	5	2020
Analysing Temporal Sensitivity of VQ-VAE Sub-Phone Codebooks J Fong, J Williams, S King Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 227-231, 2021	2	2021
Improving Polyglot Speech Synthesis through Multi-task and Adversarial Learning J Fong, J Wu, P Agrawal, A Gibiansky, T Koehler, Q He Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 172-176, 2021	2	2021
Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech J Fong, D Lyth, GE Henter, H Tang, S King Proc. Interspeech 2022, 1213-1217, 2022	1	2022
Spell4TTS: Acoustically-informed spellings for improving text-to-speech pronunciations J Fong, H Tang, S King 12th Speech Synthesis Workshop (SSW) 2023, 2023		2023
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders J Fong, Y Wang, P Agrawal, V Manohar, J Wu, T Köhler, Q He arXiv preprint arXiv:2210.16045, 2022		2022
Listening-test materials for" Where do the improvements come from in sequence-to-sequence neural TTS?" O Watts, GE Henter, J Fong, C Valentini-Botinhao 纸飞机 tv 版软件官方下载 of Edinburgh. School of Informatics. Centre for …, 2020		2020

The system can't perform the operation now. Try again later.

Articles 1–12

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors