Creating speaker independent ASR system through prosody modification based data augmentation S Shahnawazuddin, N Adiga, HK Kathania, BT Sai Pattern Recognition Letters 131, 213-218, 2020 | 54 | 2020 |
Study of formant modification for children ASR HK Kathania, SR Kadiri, P Alku, M Kurimo ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 47 | 2020 |
Effect of prosody modification on children's ASR S Shahnawazuddin, N Adiga, HK Kathania IEEE Signal Processing Letters 24 (11), 1749-1753, 2017 | 38 | 2017 |
A Formant Modification Method for Improved ASR of Children’s Speech H Kathania, S Kadiri, P Alku, M Kurimo Speech Communication, 2022 | 25 | 2022 |
Developing speaker independent ASR system using limited data through prosody modification based on fuzzy classification of spectral bins S Shahnawazuddin, N Adiga, BT Sai, W Ahmad, HK Kathania Digital Signal Processing 93, 34-42, 2019 | 21 | 2019 |
Role of prosodic features on children's speech recognition HK Kathania, S Shahnawazuddin, N Adiga, W Ahmad 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 21 | 2018 |
Explicit pitch mapping for improved children’s speech recognition HK Kathania, W Ahmad, S Shahnawazuddin, AB Samaddar Circuits, Systems, and Signal Processing 37, 2021-2044, 2018 | 17 | 2018 |
Data augmentation using prosody and false starts to recognize non-native children's speech H Kathania, M Singh, T Grósz, M Kurimo arXiv preprint arXiv:2008.12914, 2020 | 16 | 2020 |
Enhancing the recognition of children's speech on acoustically mismatched ASR system S Shahnawazuddin, HK Kathania, R Sinha TENCON 2015-2015 IEEE Region 10 Conference, 1-5, 2015 | 16 | 2015 |
Role of linear, mel and inverse-mel filterbanks in automatic recognition of speech from high-pitched speakers HK Kathania, S Shahnawazuddin, W Ahmad, N Adiga Circuits, Systems, and Signal Processing 38, 4667-4682, 2019 | 15 | 2019 |
Improving Children's Speech Recognition Through Explicit Pitch Scaling Based on Iterative Spectrogram Inversion. W Ahmad, S Shahnawazuddin, HK Kathania, G Pradhan, AB Samaddar Interspeech, 2391-2395, 2017 | 15 | 2017 |
Studying the role of pitch-adaptive spectral estimation and speaking-rate normalization in automatic speech recognition S Shahnawazuddin, N Adiga, HK Kathania, G Pradhan, R Sinha Digital Signal Processing 79, 142-151, 2018 | 14 | 2018 |
Exploring HLDA based transformation for reducing acoustic mismatch in context of children speech recognition HK Kathania, S Shahnawazuddin, R Sinha 2014 International Conference on Signal Processing and Communications (SPCOM …, 2014 | 13 | 2014 |
Synthesis speech based data augmentation for low resource children ASR V Kadyan, H Kathania, P Govil, M Kurimo Speech and Computer: 23rd International Conference, SPECOM 2021, St …, 2021 | 11 | 2021 |
Using data augmentation and time-scale modification to improve asr of children’s speech in noisy environments HK Kathania, SR Kadiri, P Alku, M Kurimo Applied Sciences 11 (18), 8420, 2021 | 10 | 2021 |
Improving Children's Speech Recognition Through Time Scale Modification Based Speaking Rate Adaptation HK Kathania, S Shahnawazuddin, W Ahmad, N Adiga, SK Jana, ... 2018 International Conference on Signal Processing and Communications (SPCOM …, 2018 | 10 | 2018 |
Data augmentation using spectral warping for low resource children asr HK Kathania, V Kadyan, SR Kadiri, M Kurimo Journal of Signal Processing Systems 94 (12), 1507-1513, 2022 | 8 | 2022 |
On the role of linear, mel and inverse-mel filterbank in the context of automatic speech recognition HK Kathania, S Shahnawazuddin, W Ahmad, N Adiga 2019 National Conference on Communications (NCC), 1-5, 2019 | 7 | 2019 |
An experimental study on the significance of variable frame-length and overlap in the context of children’s speech recognition S Shahnawazuddin, C Singh, HK Kathania, W Ahmad, G Pradhan Circuits, Systems, and Signal Processing 37 (12), 5540-5553, 2018 | 6 | 2018 |
Improving children’s mismatched ASR using structured low-rank feature projection S Shahnawazuddin, HK Kathania, A Dey, R Sinha Speech Communication 105, 103-113, 2018 | 5 | 2018 |