Generalized end-to-end loss for speaker verification L Wan, Q Wang, A Papir, IL Moreno 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 1054 | 2018 |
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems, 4480-4490, 2018 | 968 | 2018 |
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... Proc. Interspeech 2019, 2728-2732, 2019 | 431 | 2019 |
Speaker diarization with LSTM Q Wang, C Downey, L Wan, PA Mansfield, IL Moreno 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 413 | 2018 |
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language 64, 101114, 2020 | 370 | 2020 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 358 | 2024 |
Kernel principal component analysis and its applications in face recognition and active shape models Q Wang arXiv preprint arXiv:1207.3538, 2012 | 269 | 2012 |
Fully supervised speaker diarization A Zhang, Q Wang, Z Zhu, J Paisley, C Wang ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 267 | 2019 |
Attention-based models for text-dependent speaker verification FAR rahman Chowdhury, Q Wang, IL Moreno, L Wan 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 204 | 2018 |
Wavenet based low rate speech coding WB Kleijn, FSC Lim, A Luebs, J Skoglund, F Stimberg, Q Wang, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 183 | 2018 |
Sample Efficient Adaptive Text-to-Speech Y Chen, Y Assael, B Shillingford, D Budden, S Reed, H Zen, Q Wang, ... International Conference on Learning Representations (ICLR), 2019 | 159 | 2019 |
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition Q Wang, IL Moreno, M Saglam, K Wilson, A Chiao, R Liu, Y He, W Li, ... Proc. Interspeech 2020, 2677-2681, 2020 | 101 | 2020 |
Personal VAD: Speaker-Conditioned Voice Activity Detection S Ding, Q Wang, S Chang, L Wan, IL Moreno Proc. Odyssey 2020 The Speaker and Language Recognition Workshop, 433-439, 2020 | 90 | 2020 |
HMRF-EM-image: implementation of the hidden markov random field model and its expectation-maximization algorithm Q Wang arXiv preprint arXiv:1207.3510, 2012 | 83 | 2012 |
Turn-to-diarize: Online speaker diarization constrained by transformer transducer speaker turn detection W Xia, H Lu, Q Wang, A Tripathi, Y Huang, IL Moreno, H Sak ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 66 | 2022 |
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation Y Jia, MT Ramanovich, Q Wang, H Zen arXiv preprint arXiv:2201.03713, 2022 | 64 | 2022 |
Speaker verification IL Moreno, L Wan, Q Wang US Patent App. 15/211,317, 2018 | 46 | 2018 |
GMM-Based Hidden Markov Random Field for Color Image and 3D Volume Segmentation Q Wang arXiv preprint arXiv:1212.4527, 2012 | 39 | 2012 |
The Active Geometric Shape Model: A New Robust Deformable Shape Model and its Applications Q Wang, KL Boyer Computer Vision and Image Understanding, 2012 | 35 | 2012 |
Semantic Context Forests for Learning-Based Knee Cartilage Segmentation in 3D MR Images Q Wang, D Wu, L Lu, M Liu, KL Boyer, SK Zhou Medical Computer Vision. Large Data in Medical Imaging, 105-115, 2013 | 34 | 2013 |