Chao Weng

Cited by

	All	Since 2019
Citations	2458	2169
h-index	28	26
i10-index	48	45

720

360

180

540

2014201520162017201820192020202120222023202417 60 60 67 80 138 184 334 482 703 317

Public access

View all

9 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Jasha DroppoAmazonVerified email at amazon.com
Mike SeltzerFacebookVerified email at fb.com
Zhen HuangApple Inc.Verified email at apple.com
Daniel PoveyChief Speech Scientist, Xiaomi Corp.Verified email at xiaomi.com
Jinyu LiPartner Applied Science Manager, MicrosoftVerified email at microsoft.com
Kehuang LiGeorgia Institute of TechnologyVerified email at gatech.edu
patrick g haffnerAmazon AWSVerified email at amazon.com

Chao Weng

Independent R&D

No verified email


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021	159	2021
Recurrent deep neural networks for robust speech recognition C Weng, D Yu, S Watanabe, BHF Juang 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014	156	2014
Diffsound: Discrete diffusion model for text-to-sound generation D Yang, J Yu, H Wang, W Wang, C Weng, Y Zou, D Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023	154	2023
Replay and synthetic speech detection with res2net architecture X Li, N Li, C Weng, X Liu, D Su, D Yu, H Meng ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021	134	2021
Deep neural networks for single-channel multi-talker speech recognition C Weng, D Yu, ML Seltzer, J Droppo IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (10 …, 2015	112	2015
DurIAN: Duration Informed Attention Network for Speech Synthesis. C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... Interspeech, 2027-2031, 2020	100	2020
Durian: Duration informed attention network for multimodal synthesis C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... arXiv preprint arXiv:1909.01700, 2019	99	2019
Component fusion: Learning replaceable language model component for end-to-end speech recognition system C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	97	2019
Past review, current progress, and challenges ahead on the cocktail party problem Y Qian, C Weng, X Chang, S Wang, D Yu Frontiers of Information Technology & Electronic Engineering 19, 40-63, 2018	90	2018
Investigating end-to-end speech recognition for mandarin-english code-switching C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	78	2019
Self-supervised text-independent speaker verification using prototypical momentum contrastive learning W Xia, C Zhang, C Weng, M Yu, D Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	65	2021
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition AS Subramanian, C Weng, S Watanabe, M Yu, D Yu Computer Speech & Language 75, 101360, 2022	64	2022
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition. C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu Interspeech, 761-765, 2018	61	2018
Mixed speech recognition D Yu, C Weng, ML Seltzer, J Droppo US Patent 9,390,712, 2016	59	2016
Single-channel mixed speech recognition using deep neural networks C Weng, D Yu, ML Seltzer, J Droppo 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014	47	2014
Far-field location guided target speech extraction using end-to-end speech recognition objectives AS Subramanian, C Weng, M Yu, SX Zhang, Y Xu, S Watanabe, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	43	2020
Pitchnet: Unsupervised singing voice conversion with pitch adversarial network C Deng, C Yu, H Lu, C Weng, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	41	2020
Joint training of complex ratio mask based beamformer and acoustic model for noise robust asr Y Xu, C Weng, L Hui, J Liu, M Yu, D Su, D Yu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	41	2019
Simple attention module based speaker verification with iterative noisy label detection X Qin, N Li, C Weng, D Su, M Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	40	2022
Videocrafter1: Open diffusion models for high-quality video generation H Chen, M Xia, Y He, Y Zhang, X Cun, S Yang, J Xing, Y Liu, Q Chen, ... arXiv preprint arXiv:2310.19512, 2023	39	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors