Segui
Taejin Park
Titolo
Citata da
Citata da
Anno
A review of speaker diarization: Recent advances with deep learning
TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan
Computer Speech & Language 72, 101317, 2022
3912022
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap
TJ Park, KJ Han, M Kumar, S Narayanan
IEEE Signal Processing Letters 27, 381-385, 2019
1402019
Titanet: Neural model for speaker representation with 1d depth-wise separable convolutions and global context
NR Koluguri, T Park, B Ginsburg
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1112022
Binaural rendering method and apparatus for decoding multi channel audio
YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ...
US Patent 9,319,819, 2016
502016
Musical instrument sound classification with deep convolutional neural network using feature fusion approach
T Park, T Lee
arXiv preprint arXiv:1512.07370, 2015
492015
Multimodal speaker segmentation and diarization using lexical and acoustic cues via sequence to sequence neural networks
TJ Park, P Georgiou
arXiv preprint arXiv:1805.10731, 2018
452018
Speaker diarization with lexical information
TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan
arXiv preprint arXiv:2004.06756, 2020
392020
Multi-scale speaker diarization with dynamic scale weighting
TJ Park, NR Koluguri, J Balam, B Ginsburg
arXiv preprint arXiv:2203.15974, 2022
282022
Speaker diarization using latent space clustering in generative adversarial network
M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
262020
Meta-learning with latent space clustering in generative adversarial network for speaker diarization
M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan
IEEE/ACM transactions on audio, speech, and language processing 29, 1204-1219, 2021
242021
Tackling dynamics in federated incremental learning with variational embedding rehearsal
TJ Park, K Kumatani, D Dimitriadis
arXiv preprint arXiv:2110.09695, 2021
202021
Multi-scale speaker diarization with neural affinity score fusion
TJ Park, M Kumar, S Narayanan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
192021
Enhancing speaker diarization with large language models: A contextual beam search approach
TJ Park, K Dhawan, N Koluguri, J Balam
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
152024
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations
SN Chakravarthula, M Nasir, SY Tseng, H Li, TJ Park, B Baucom, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
152020
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech.
A Jati, R Peri, M Pal, TJ Park, N Kumar, R Travadi, PG Georgiou, ...
Interspeech, 2463-2467, 2019
152019
Encoding/decoding apparatus for processing channel signal and method therefor
JI Seo, SK Beack, DY Jang, KO Kang, TJ Park, YJ Lee, KW Choi, JW Kim
US Patent 10,068,579, 2018
132018
The Second DIHARD Challenge: System Description for USC-SAIL Team.
TJ Park, M Kumar, N Flemotomos, M Pal, R Peri, R Lahiri, PG Georgiou, ...
INTERSPEECH, 998-1002, 2019
112019
Robust multi-channel speech recognition using frequency aligned network
T Park, K Kumatani, M Wu, S Sundaram
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
82020
Binaural rendering method and apparatus for decoding multi channel audio
YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ...
US Patent 10,199,045, 2019
82019
Apparatus for processing audio signal for sound bar and method therefor
JI Seo, DY Jang, TJ Park, KW Choi, KO Kang, JW Kim
US Patent App. 14/760,770, 2015
82015
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20