Saurabh Sahu - Google Scholar

Get my own profile

Cited by

	All	Since 2019
Citations	373	350
h-index	9	9
i10-index	9	8

0

80

40

2016201720182019202020212022202320241 3 17 57 71 73 78 57 14

Saurabh Sahu

Saurabh Sahu

University of Maryland

Verified email at umd.edu

Emotion recognition multi-modal analysis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Adversarial auto-encoders for speech based emotion recognition S Sahu, R Gupta, G Sivaraman, W AbdAlmageed, C Espy-Wilson arXiv preprint arXiv:1806.02146, 2018	101	2018
On enhancing speech emotion recognition using generative adversarial networks S Sahu, R Gupta, C Espy-Wilson arXiv preprint arXiv:1806.06626, 2018	70	2018
Multi-Modal Learning for Speech Emotion Recognition: An Analysis and Comparison of ASR Outputs with Ground Truth Transcription. S Sahu, V Mitra, N Seneviratne, CY Espy-Wilson Interspeech, 3302-3306, 2019	39	2019
Semi-supervised and transfer learning approaches for low resource sentiment classification R Gupta, S Sahu, C Espy-Wilson, S Narayanan 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	37	2018
SCL-UMD at the Medico Task-MediaEval 2017: Transfer Learning based Classification of Medical Images. T Agrawal, R Gupta, S Sahu, CY Espy-Wilson MediaEval, 2017	37	2017
Speech Features for Depression Detection. S Sahu, CY Espy-Wilson INTERSPEECH, 1928-1932, 2016	16	2016
Smoothing model predictions using adversarial training procedures for speech based emotion recognition S Sahu, R Gupta, G Sivaraman, C Espy-Wilson 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	15	2018
Modeling feature representations for affective speech using generative adversarial networks S Sahu, R Gupta, C Espy-Wilson IEEE Transactions on Affective Computing, 2020	11	2020
An Affect Prediction Approach Through Depression Severity Parameter Incorporation in Neural Networks. R Gupta, S Sahu, CY Espy-Wilson, SS Narayanan INTERSPEECH, 3122-3126, 2017	10	2017
Cross-modal Learning for Multi-modal Video Categorization P Goyal, S Sahu, S Ghosh, C Lee arXiv preprint arXiv:2003.03501, 2020	9	2020
Effects of depression on speech S Sahu, C Espy-Wilson The Journal of the Acoustical Society of America 136 (4), 2312-2312, 2014	8	2014
Effect of depression on syllabic rate of speech S Sahu, C Espy-Wilson J Acoustical Society of America 138, 1781, 2015	7	2015
Cross-modal Non-linear Guided Attention and Temporal Coherence in Multi-modal Deep Video Models S Sahu, P Goyal, S Ghosh, C Lee Proceedings of the 28th ACM International Conference on Multimedia, 313-321, 2020	5	2020
Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training S Sahu, P Goyal arXiv preprint arXiv:2103.10043, 2021	3	2021
Towards Building Generalizable Speech Emotion Recognition Models S Sahu	2	2019
Leveraging Local Temporal Information for Multimodal Scene Classification S Sahu, P Goyal ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	1	2022
Can't Fool Me: Adversarially Robust Transformer for Video Understanding D Choudhary, P Goyal, S Sahu arXiv preprint arXiv:2110.13950, 2021	1	2021
Exploiting Temporal Coherence for Multi-modal Video Categorization P Goyal, S Sahu, S Ghosh, C Lee arXiv preprint arXiv:2002.03844, 2020	1	2020

The system can't perform the operation now. Try again later.

Articles 1–18