Follow
Saurabh Sahu
Title
Cited by
Cited by
Year
Adversarial auto-encoders for speech based emotion recognition
S Sahu, R Gupta, G Sivaraman, W AbdAlmageed, C Espy-Wilson
arXiv preprint arXiv:1806.02146, 2018
1012018
On enhancing speech emotion recognition using generative adversarial networks
S Sahu, R Gupta, C Espy-Wilson
arXiv preprint arXiv:1806.06626, 2018
702018
Multi-Modal Learning for Speech Emotion Recognition: An Analysis and Comparison of ASR Outputs with Ground Truth Transcription.
S Sahu, V Mitra, N Seneviratne, CY Espy-Wilson
Interspeech, 3302-3306, 2019
392019
Semi-supervised and transfer learning approaches for low resource sentiment classification
R Gupta, S Sahu, C Espy-Wilson, S Narayanan
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
372018
SCL-UMD at the Medico Task-MediaEval 2017: Transfer Learning based Classification of Medical Images.
T Agrawal, R Gupta, S Sahu, CY Espy-Wilson
MediaEval, 2017
372017
Speech Features for Depression Detection.
S Sahu, CY Espy-Wilson
INTERSPEECH, 1928-1932, 2016
162016
Smoothing model predictions using adversarial training procedures for speech based emotion recognition
S Sahu, R Gupta, G Sivaraman, C Espy-Wilson
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
152018
Modeling feature representations for affective speech using generative adversarial networks
S Sahu, R Gupta, C Espy-Wilson
IEEE Transactions on Affective Computing, 2020
112020
An Affect Prediction Approach Through Depression Severity Parameter Incorporation in Neural Networks.
R Gupta, S Sahu, CY Espy-Wilson, SS Narayanan
INTERSPEECH, 3122-3126, 2017
102017
Cross-modal Learning for Multi-modal Video Categorization
P Goyal, S Sahu, S Ghosh, C Lee
arXiv preprint arXiv:2003.03501, 2020
92020
Effects of depression on speech
S Sahu, C Espy-Wilson
The Journal of the Acoustical Society of America 136 (4), 2312-2312, 2014
82014
Effect of depression on syllabic rate of speech
S Sahu, C Espy-Wilson
J Acoustical Society of America 138, 1781, 2015
72015
Cross-modal Non-linear Guided Attention and Temporal Coherence in Multi-modal Deep Video Models
S Sahu, P Goyal, S Ghosh, C Lee
Proceedings of the 28th ACM International Conference on Multimedia, 313-321, 2020
52020
Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training
S Sahu, P Goyal
arXiv preprint arXiv:2103.10043, 2021
32021
Towards Building Generalizable Speech Emotion Recognition Models
S Sahu
22019
Leveraging Local Temporal Information for Multimodal Scene Classification
S Sahu, P Goyal
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
12022
Can't Fool Me: Adversarially Robust Transformer for Video Understanding
D Choudhary, P Goyal, S Sahu
arXiv preprint arXiv:2110.13950, 2021
12021
Exploiting Temporal Coherence for Multi-modal Video Categorization
P Goyal, S Sahu, S Ghosh, C Lee
arXiv preprint arXiv:2002.03844, 2020
12020
The system can't perform the operation now. Try again later.
Articles 1–18