Adversarial auto-encoders for speech based emotion recognition S Sahu, R Gupta, G Sivaraman, W AbdAlmageed, C Espy-Wilson arXiv preprint arXiv:1806.02146, 2018 | 116 | 2018 |
On enhancing speech emotion recognition using generative adversarial networks S Sahu, R Gupta, C Espy-Wilson arXiv preprint arXiv:1806.06626, 2018 | 80 | 2018 |
Multi-Modal Learning for Speech Emotion Recognition: An Analysis and Comparison of ASR Outputs with Ground Truth Transcription. S Sahu, V Mitra, N Seneviratne, CY Espy-Wilson Interspeech, 3302-3306, 2019 | 45 | 2019 |
SCL-UMD at the Medico Task-MediaEval 2017: Transfer Learning based Classification of Medical Images. T Agrawal, R Gupta, S Sahu, CY Espy-Wilson MediaEval, 2017 | 45 | 2017 |
Semi-supervised and transfer learning approaches for low resource sentiment classification R Gupta, S Sahu, C Espy-Wilson, S Narayanan 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 40 | 2018 |
Speech Features for Depression Detection. S Sahu, CY Espy-Wilson INTERSPEECH, 1928-1932, 2016 | 20 | 2016 |
Smoothing model predictions using adversarial training procedures for speech based emotion recognition S Sahu, R Gupta, G Sivaraman, C Espy-Wilson 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 17 | 2018 |
Modeling feature representations for affective speech using generative adversarial networks S Sahu, R Gupta, C Espy-Wilson IEEE Transactions on Affective Computing, 2020 | 16 | 2020 |
Cross-modal Learning for Multi-modal Video Categorization P Goyal, S Sahu, S Ghosh, C Lee arXiv preprint arXiv:2003.03501, 2020 | 12 | 2020 |
An Affect Prediction Approach Through Depression Severity Parameter Incorporation in Neural Networks. R Gupta, S Sahu, CY Espy-Wilson, SS Narayanan INTERSPEECH, 3122-3126, 2017 | 11 | 2017 |
Effects of depression on speech S Sahu, C Espy-Wilson The Journal of the Acoustical Society of America 136 (4), 2312-2312, 2014 | 11 | 2014 |
Effect of depression on syllabic rate of speech S Sahu, C Espy-Wilson J Acoustical Society of America 138, 1781, 2015 | 8 | 2015 |
Cross-modal Non-linear Guided Attention and Temporal Coherence in Multi-modal Deep Video Models S Sahu, P Goyal, S Ghosh, C Lee Proceedings of the 28th ACM International Conference on Multimedia, 313-321, 2020 | 5 | 2020 |
Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training S Sahu, P Goyal arXiv preprint arXiv:2103.10043, 2021 | 3 | 2021 |
Can't Fool Me: Adversarially Robust Transformer for Video Understanding D Choudhary, P Goyal, S Sahu arXiv preprint arXiv:2110.13950, 2021 | 2 | 2021 |
Towards Building Generalizable Speech Emotion Recognition Models S Sahu | 2 | 2019 |
Leveraging Local Temporal Information for Multimodal Scene Classification S Sahu, P Goyal ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 1 | 2022 |
Exploiting Temporal Coherence for Multi-modal Video Categorization P Goyal, S Sahu, S Ghosh, C Lee arXiv preprint arXiv:2002.03844, 2020 | 1 | 2020 |