Folgen
Matt Mirsamadi
Matt Mirsamadi
Sonstige NamenSeyedmahdad Mirsamadi
Machine Learning Research Engineer at Apple Siri
Bestätigte E-Mail-Adresse bei apple.com
Titel
Zitiert von
Zitiert von
Jahr
Automatic speech emotion recognition using recurrent neural networks with local attention
S Mirsamadi, E Barsoum, C Zhang
ICASSP 2017, 2227-2231, 2017
8562017
Causal Speech Enhancement Combining Data-Driven Learning and Suppression Rule Estimation
S Mirsamadi, I Tashev
Interspeech 2016, 2870-2874, 2016
372016
DNN-based causal voice activity detector
I Tashev, S Mirsamadi
Information Theory and Applications Workshop, 2016
282016
Multi-domain adversarial training of neural network acoustic models for distant speech recognition
S Mirsamadi, JHL Hansen
Speech Communication 106, 21-30, 2019
262019
A study on deep neural network acoustic model adaptation for robust far-field speech recognition
S Mirsamadi, JHL Hansen
Interspeech 2015, 2430-2434, 2015
262015
On multi-domain training and adaptation of end-to-end RNN acoustic models for distant speech recognition
S Mirsamadi, JHL Hansen
Interspeech 2017, 2017
232017
Multichannel speech dereverberation based on convolutive nonnegative tensor factorization for ASR applications
S Mirsamadi, JHL Hansen
Interspeech 2014, 2828-2832, 2014
202014
A generalized nonnegative tensor factorization approach for distant speech recognition with distributed microphones
S Mirsamadi, JHL Hansen
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 24 …, 2016
172016
Multichannel feature enhancement in distributed microphone arrays for robust distant speech recognition in smart rooms
S Mirsamadi, JHL Hansen
2014 IEEE Spoken Language Technology Workshop (SLT), 507-512, 2014
122014
A multimodal approach to device-directed speech detection with large language models
D Wagner, A Churchill, S Sigtia, P Georgiou, M Mirsamadi, A Mishra, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
92024
Efficient frequency domain implementation of noncausal multichannel blind deconvolution for convolutive mixtures of speech
S Mirsamadi, S Ghaffarzadegan, H Sheikhzadeh, SM Ahadi, AH Rezaie
IEEE transactions on audio, speech, and language processing 20 (8), 2365-2377, 2012
82012
Multimodal data and resource efficient device-directed speech detection with large foundation models
D Wagner, A Churchill, S Sigtia, P Georgiou, M Mirsamadi, A Mishra, ...
arXiv preprint arXiv:2312.03632, 2023
42023
Robust acoustic modeling and front-end design for distant speech recognition
S Mirsamadi
The University of Texas at Dallas, 2017
12017
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–13