Deep neural network based learning and transferring mid-level audio features for acoustic scene classification S Mun, S Shon, W Kim, DK Han, H Ko 2017 IEEE international conference on acoustics, speech and signal …, 2017 | 74 | 2017 |
Automatic voice onset time detection for unvoiced stops (/p/,/t/,/k/) with application to accent classification JHL Hansen, SS Gray, W Kim Speech Communication 52 (10), 777-789, 2010 | 61 | 2010 |
Feature compensation in the cepstral domain employing model combination W Kim, JHL Hansen Speech Communication 51 (2), 83-96, 2009 | 61 | 2009 |
Autoencoder based domain adaptation for speaker recognition under insufficient channel information S Shon, S Mun, W Kim, H Ko arXiv preprint arXiv:1708.01227, 2017 | 41 | 2017 |
Noise variance estimation for Kalman filtering of noisy speech W Kim, H Ko IEICE TRANSACTIONS on Information and Systems 84 (1), 155-160, 2001 | 40 | 2001 |
Robust emotional stressed speech detection using weighted frequency subbands JHL Hansen, W Kim, M Rahurkar, E Ruzanski, J Meyerhoff EURASIP Journal on Advances in Signal Processing 2011, 1-10, 2011 | 38 | 2011 |
Spectral subtraction based on phonetic dependency and masking effects W Kim, S Kang, H Ko IEE Proceedings-Vision, Image and Signal Processing 147 (5), 423-427, 2000 | 35 | 2000 |
Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise W Kim, RM Stern Speech Communication 53 (1), 1-11, 2011 | 33 | 2011 |
Band-independent mask estimation for missing-feature reconstruction in the presence of unknown background noise W Kim, RM Stern 2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006 | 33 | 2006 |
Time–Frequency Correlation-Based Missing-Feature Reconstruction for Robust Speech Recognition in Band-Restricted Conditions W Kim, JHL Hansen IEEE Transactions on Audio, Speech, and Language Processing 17 (7), 1292-1304, 2009 | 24 | 2009 |
Deep Neural Network Bottleneck Features for Acoustic Event Recognition. S Mun, S Shon, W Kim, H Ko Interspeech, 2954-2957, 2016 | 23 | 2016 |
Voice Activity Detection in Noisy Environments Based on Double‐Combined Fourier Transform and Line Fitting J Park, W Kim, DK Han, H Ko The Scientific World Journal 2014 (1), 146040, 2014 | 22 | 2014 |
Missing-feature reconstruction by leveraging temporal spectral correlation for robust speech recognition in background noise conditions W Kim, JHL Hansen IEEE transactions on audio, speech, and language processing 18 (8), 2111-2120, 2010 | 17 | 2010 |
Angry emotion detection from real-life conversational speech by leveraging content structure W Kim, JHL Hansen 2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010 | 17 | 2010 |
A novel mask estimation method employing posterior-based representative mean estimate for missing-feature speech recognition W Kim, JHL Hansen IEEE Transactions on Audio, Speech, and Language Processing 19 (5), 1434-1443, 2010 | 16 | 2010 |
A novel discriminative feature extraction for acoustic scene classification using RNN based source separation S Mun, S Shon, W Kim, DK Han, H Ko IEICE TRANSACTIONS on Information and Systems 100 (12), 3041-3044, 2017 | 15 | 2017 |
DNN transfer learning based non-linear feature extraction for acoustic event classification S Mun, M Shin, S Shon, W Kim, DK Han, H Ko IEICE TRANSACTIONS on Information and Systems 100 (9), 2249-2252, 2017 | 15 | 2017 |
Feature compensation scheme based on parallel combined mixture model. W Kim, S Ahn, H Ko INTERSPEECH, 677-680, 2003 | 14 | 2003 |
Speech under stress and Lombard effect: impact and solutions for forensic speaker recognition JHL Hansen, A Sangwan, W Kim Forensic speaker recognition: law enforcement and counter-terrorism, 103-123, 2012 | 11 | 2012 |
Phonetic distance based confidence measure W Kim, JHL Hansen IEEE Signal Processing Letters 17 (2), 121-124, 2009 | 11 | 2009 |