Follow
Quan Wang
Quan Wang
Senior Staff Software Engineer @ Google; Instructor @ Udemy; Textbook Author; IEEE Senior Member
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Generalized end-to-end loss for speaker verification
L Wan, Q Wang, A Papir, IL Moreno
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
10542018
Transfer learning from speaker verification to multispeaker text-to-speech synthesis
Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ...
Advances in neural information processing systems, 4480-4490, 2018
9682018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ...
Proc. Interspeech 2019, 2728-2732, 2019
4312019
Speaker diarization with LSTM
Q Wang, C Downey, L Wan, PA Mansfield, IL Moreno
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
4132018
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language 64, 101114, 2020
3702020
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
3582024
Kernel principal component analysis and its applications in face recognition and active shape models
Q Wang
arXiv preprint arXiv:1207.3538, 2012
2692012
Fully supervised speaker diarization
A Zhang, Q Wang, Z Zhu, J Paisley, C Wang
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
2672019
Attention-based models for text-dependent speaker verification
FAR rahman Chowdhury, Q Wang, IL Moreno, L Wan
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2042018
Wavenet based low rate speech coding
WB Kleijn, FSC Lim, A Luebs, J Skoglund, F Stimberg, Q Wang, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1832018
Sample Efficient Adaptive Text-to-Speech
Y Chen, Y Assael, B Shillingford, D Budden, S Reed, H Zen, Q Wang, ...
International Conference on Learning Representations (ICLR), 2019
1592019
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Q Wang, IL Moreno, M Saglam, K Wilson, A Chiao, R Liu, Y He, W Li, ...
Proc. Interspeech 2020, 2677-2681, 2020
1012020
Personal VAD: Speaker-Conditioned Voice Activity Detection
S Ding, Q Wang, S Chang, L Wan, IL Moreno
Proc. Odyssey 2020 The Speaker and Language Recognition Workshop, 433-439, 2020
902020
HMRF-EM-image: implementation of the hidden markov random field model and its expectation-maximization algorithm
Q Wang
arXiv preprint arXiv:1207.3510, 2012
832012
Turn-to-diarize: Online speaker diarization constrained by transformer transducer speaker turn detection
W Xia, H Lu, Q Wang, A Tripathi, Y Huang, IL Moreno, H Sak
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
662022
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Y Jia, MT Ramanovich, Q Wang, H Zen
arXiv preprint arXiv:2201.03713, 2022
642022
Speaker verification
IL Moreno, L Wan, Q Wang
US Patent App. 15/211,317, 2018
462018
GMM-Based Hidden Markov Random Field for Color Image and 3D Volume Segmentation
Q Wang
arXiv preprint arXiv:1212.4527, 2012
392012
The Active Geometric Shape Model: A New Robust Deformable Shape Model and its Applications
Q Wang, KL Boyer
Computer Vision and Image Understanding, 2012
352012
Semantic Context Forests for Learning-Based Knee Cartilage Segmentation in 3D MR Images
Q Wang, D Wu, L Lu, M Liu, KL Boyer, SK Zhou
Medical Computer Vision. Large Data in Medical Imaging, 105-115, 2013
342013
The system can't perform the operation now. Try again later.
Articles 1–20