Automatic speech emotion recognition using recurrent neural networks with local attention S Mirsamadi, E Barsoum, C Zhang ICASSP 2017, 2227-2231, 2017 | 856 | 2017 |
Causal Speech Enhancement Combining Data-Driven Learning and Suppression Rule Estimation S Mirsamadi, I Tashev Interspeech 2016, 2870-2874, 2016 | 37 | 2016 |
DNN-based causal voice activity detector I Tashev, S Mirsamadi Information Theory and Applications Workshop, 2016 | 28 | 2016 |
Multi-domain adversarial training of neural network acoustic models for distant speech recognition S Mirsamadi, JHL Hansen Speech Communication 106, 21-30, 2019 | 26 | 2019 |
A study on deep neural network acoustic model adaptation for robust far-field speech recognition S Mirsamadi, JHL Hansen Interspeech 2015, 2430-2434, 2015 | 26 | 2015 |
On multi-domain training and adaptation of end-to-end RNN acoustic models for distant speech recognition S Mirsamadi, JHL Hansen Interspeech 2017, 2017 | 23 | 2017 |
Multichannel speech dereverberation based on convolutive nonnegative tensor factorization for ASR applications S Mirsamadi, JHL Hansen Interspeech 2014, 2828-2832, 2014 | 20 | 2014 |
A generalized nonnegative tensor factorization approach for distant speech recognition with distributed microphones S Mirsamadi, JHL Hansen IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 24 …, 2016 | 17 | 2016 |
Multichannel feature enhancement in distributed microphone arrays for robust distant speech recognition in smart rooms S Mirsamadi, JHL Hansen 2014 IEEE Spoken Language Technology Workshop (SLT), 507-512, 2014 | 12 | 2014 |
A multimodal approach to device-directed speech detection with large language models D Wagner, A Churchill, S Sigtia, P Georgiou, M Mirsamadi, A Mishra, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 9 | 2024 |
Efficient frequency domain implementation of noncausal multichannel blind deconvolution for convolutive mixtures of speech S Mirsamadi, S Ghaffarzadegan, H Sheikhzadeh, SM Ahadi, AH Rezaie IEEE transactions on audio, speech, and language processing 20 (8), 2365-2377, 2012 | 8 | 2012 |
Multimodal data and resource efficient device-directed speech detection with large foundation models D Wagner, A Churchill, S Sigtia, P Georgiou, M Mirsamadi, A Mishra, ... arXiv preprint arXiv:2312.03632, 2023 | 4 | 2023 |
Robust acoustic modeling and front-end design for distant speech recognition S Mirsamadi The University of Texas at Dallas, 2017 | 1 | 2017 |