The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 1222 | 2024 |
Conversational speech transcription using context-dependent deep neural networks. F Seide, G Li, D Yu Interspeech, 437-440, 2011 | 1166 | 2011 |
1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs. F Seide, H Fu, J Droppo, G Li, D Yu Interspeech 2014, 1058-1062, 2014 | 1142 | 2014 |
Recent advances in deep learning for speech research at Microsoft L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ... 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 1064 | 2013 |
Marian: Fast neural machine translation in C++ M Junczys-Dowmunt, R Grundkiewicz, T Dwojak, H Hoang, K Heafield, ... arXiv preprint arXiv:1804.00344, 2018 | 815 | 2018 |
Feature engineering in context-dependent deep neural networks for conversational speech transcription F Seide, G Li, X Chen, D Yu 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 24-29, 2011 | 787 | 2011 |
Achieving human parity on automatic chinese to english news translation H Hassan, A Aue, C Chen, V Chowdhary, J Clark, C Federmann, X Huang, ... arXiv preprint arXiv:1803.05567, 2018 | 733 | 2018 |
Achieving human parity in conversational speech recognition W Xiong, J Droppo, X Huang, F Seide, M Seltzer, A Stolcke, D Yu, ... arXiv preprint arXiv:1610.05256, 2016 | 727 | 2016 |
CNTK: Microsoft's open-source deep-learning toolkit F Seide, A Agarwal Proceedings of the 22nd ACM SIGKDD international conference on knowledge …, 2016 | 651 | 2016 |
KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition D Yu, K Yao, H Su, G Li, F Seide 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 525 | 2013 |
An introduction to computational networks and the computational network toolkit D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ... Microsoft Technical Report MSR-TR-2014–112, 2014 | 476 | 2014 |
The Philips automatic train timetable information system H Aust, M Oerder, F Seide, V Steinbiss Speech Communication 17 (3-4), 249-262, 1995 | 347 | 1995 |
Feature learning in deep neural networks-studies on speech recognition tasks D Yu, ML Seltzer, J Li, JT Huang, F Seide arXiv preprint arXiv:1301.3605, 2013 | 324 | 2013 |
Toward human parity in conversational speech recognition W Xiong, J Droppo, X Huang, F Seide, ML Seltzer, A Stolcke, D Yu, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (12 …, 2017 | 259 | 2017 |
Internet search-based television F Seide, L Lu, N Moraveji, R Yu, WY Ma US Patent App. 11/405,369, 2007 | 258 | 2007 |
Adaptation of context-dependent deep neural networks for automatic speech recognition K Yao, D Yu, F Seide, H Su, L Deng, Y Gong 2012 IEEE Spoken Language Technology Workshop (SLT), 366-369, 2012 | 257 | 2012 |
Exploiting sparseness in deep neural networks for large vocabulary speech recognition D Yu, F Seide, G Li, L Deng 2012 IEEE International conference on acoustics, speech and signal …, 2012 | 192 | 2012 |
Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription H Su, G Li, D Yu, F Seide 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 182 | 2013 |
System and method for utilizing the content of audio/video files to select advertising content for display Y Li, L Li, T Najm, H Gao, B Zhang, X Wang, F Seide, R Yu, HJ Zeng, ... US Patent App. 11/084,616, 2006 | 176 | 2006 |
Deep neural networks training for speech and pattern recognition FTB Seide, G Li, D Yu, AC Eversole, X Chen US Patent 9,477,925, 2016 | 157 | 2016 |