Follow
Shigeki Karita
Shigeki Karita
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
ESPnet: End-to-end speech processing toolkit
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 2018
16762018
A comparative study on transformer vs rnn in speech applications
S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ...
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
8592019
Improving Transformer-based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration
S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani
Proc. Interspeech 2019, 1408-1412, 2019
2702019
ESPnet-ST: All-in-one speech translation toolkit
H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ...
arXiv preprint arXiv:2004.10234, 2020
1742020
Semi-Supervised End-to-End Speech Recognition
S Karita, S Watanabe, T Iwata, A Ogawa, M Delcroix
INTERSPEECH, 2-6, 2018
792018
Frame-by-frame closed-form update for mask-based adaptive MVDR beamforming
T Higuchi, K Kinoshita, N Ito, S Karita, T Nakatani
IEEE International Conference on Acoustics, Speech and Signal Processing, 2018
642018
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans
S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ...
2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021
562021
Semi-Supervised End-to-End Speech Recognition Using Text-to-Speech and Autoencoders
S Karita, S Watanabe, T Iwata, M Delcroix, A Ogawa, T Nakatani
IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019
532019
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement
Y Koizumi, S Karita, S Wisdom, H Erdogan, JR Hershey, L Jones, ...
2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021
512021
Auxiliary feature based adaptation of end-to-end ASR systems
M Delcroix, S Watanabe, A Ogawa, S Karita, T Nakatani
INTERSPEECH, 2018
472018
Far-field speech recognition using CNN-DNN-HMM with convolution in time
T Yoshioka, S Karita, T Nakatani
2015 IEEE international conference on acoustics, speech and signal …, 2015
392015
Libritts-r: A restored multi-speaker text-to-speech corpus
Y Koizumi, H Zen, S Karita, Y Ding, K Yatabe, N Morioka, M Bacchiani, ...
arXiv preprint arXiv:2305.18802, 2023
352023
Rescoring n-best speech recognition list based on one-on-one hypothesis comparison using encoder-classifier model
A Ogawa, M Delcroix, S Karita, T Nakatani
IEEE International Conference on Acoustics, Speech and Signal Processing, 2018
302018
Sequence training of encoder-decoder model using policy gradient for end-to-end speech recognition
S Karita, A Ogawa, M Delcroix, T Nakatani
IEEE International Conference on Acoustics, Speech and Signal Processing, 2018
302018
End-to-End SpeakerBeam for Single Channel Target Speech Recognition.
M Delcroix, S Watanabe, T Ochiai, K Kinoshita, S Karita, A Ogawa, ...
Interspeech, 451-455, 2019
282019
Knowledge transfer from large-scale pretrained language models to end-to-end speech recognizers
Y Kubo, S Karita, M Bacchiani
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
272022
Self-Distillation for Improving CTC-Transformer-Based ASR Systems.
T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ...
INTERSPEECH, 546-550, 2020
242020
Espnet: End-to-end speech processing toolkit. arXiv 2018
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 2018
202018
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming
S Araki, N Ito, M Delcroix, A Ogawa, K Kinoshita, T Higuchi, T Yoshioka, ...
2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 16-20, 2017
202017
Learning device, learning method, and learning program
A Ogawa, M Delcroix, S Karita, T Nakatani
US Patent App. 16/966,056, 2020
192020
The system can't perform the operation now. Try again later.
Articles 1–20