Self-critical sequence training for image captioning SJ Rennie, E Marcheret, Y Mroueh, J Ross, V Goel Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 2271 | 2017 |
Deep multimodal learning for audio-visual speech recognition Y Mroueh, E Marcheret, V Goel 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 281 | 2015 |
Speech recognition in noisy environments E Epstein, B Lewis, E Marcheret US Patent 6,850,887, 2005 | 170 | 2005 |
Incremental on-line feature space MLLR adaptation for telephony speech recognition. Y Li, H Erdogan, Y Gao, E Marcheret INTERSPEECH, 1417-1420, 2002 | 79 | 2002 |
Methods and apparatus for speech recognition using visual information E Marcheret, J Vopicka, V Goel US Patent 10,109,277, 2018 | 59 | 2018 |
Audio-visual speech recognition with scattering operators E Marcheret, J Vopicka, V Goel US Patent 9,697,833, 2017 | 41 | 2017 |
Techniques for evaluation, building and/or retraining of a classification model E Marcheret US Patent 9,031,897, 2015 | 38 | 2015 |
Detecting audio-visual synchrony using deep neural networks. E Marcheret, G Potamianos, J Vopicka, V Goel INTERSPEECH 2, 4, 2015 | 36 | 2015 |
The IBM RT07 evaluation systems for speaker diarization on lecture meetings J Huang, E Marcheret, K Visweswariah, G Potamianos International Evaluation Workshop on Rich Transcription, 497-508, 2007 | 32 | 2007 |
Dynamic stream weight modeling for audio-visual speech recognition E Marcheret, V Libal, G Potamianos 2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007 | 29 | 2007 |
Audio and visual modality combination in speech processing applications G Potamianos, E Marcheret, Y Mroueh, V Goel, A Koumbaroulis, ... The Handbook of Multimodal-Multisensor Interfaces: Foundations, User …, 2017 | 28 | 2017 |
Method for likelihood computation in multi-stream HMM based speech recognition SM Chu, V Goel, E Marcheret, G Potamianos US Patent 7,480,617, 2009 | 27 | 2009 |
Towards practical deployment of audio-visual speech recognition G Potamianos, C Neti, J Huang, JH Connell, S Chu, V Libal, E Marcheret, ... 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 25 | 2004 |
Rapid feature space speaker adaptation for multi-stream HMM-based audio-visual speech recognition J Huang, E Marcheret, K Visweswariah 2005 IEEE International Conference on Multimedia and Expo, 338-341, 2005 | 23 | 2005 |
Automatic speech recognition and speech activity detection in the CHIL smart room SM Chu, E Marcheret, G Potamianos Machine Learning for Multimodal Interaction: Second International Workshop …, 2006 | 22 | 2006 |
Speech recognition system adaptation based on non-acoustic attributes and face selection based on mouth motion using pixel intensities IIJH Connell, E Marcheret US Patent 9,899,025, 2018 | 21 | 2018 |
Audio-visual speech synchronization detection using a bimodal linear prediction model K Kumar, J Navratil, E Marcheret, V Libal, G Ramaswamy, G Potamianos 2009 IEEE Computer Society Conference on Computer Vision and Pattern …, 2009 | 21 | 2009 |
A real-time prototype for small-vocabulary audio-visual ASR JH Connell, N Haas, E Marcheret, C Neti, G Potamianos, S Velipasalar 2003 International Conference on Multimedia and Expo. ICME'03. Proceedings …, 2003 | 20 | 2003 |
An extensible language interfacefor robot manipulation J Connell, E Marcheret, S Pankanti, M Kudoh, R Nishiyama Artificial General Intelligence: 5th International Conference, AGI 2012 …, 2012 | 19 | 2012 |
Audio-only backoff in audio-visual speech recognition system JH Connell, N Haas, E Marcheret, CV Neti, G Potamianos US Patent 7,251,603, 2007 | 19 | 2007 |