Shigeki Karita

Cited by

	All	Since 2019
Citations	3697	3638
h-index	19	19
i10-index	25	25

860

430

215

645

2017201820192020202120222023202410 37 188 442 778 749 852 622

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Tomohiro NakataniNTT Communication Science LaboratoriesVerified email at ieee.org
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Atsunori OgawaNTT Communication Science LaboratoriesVerified email at ieee.org
Takaaki HoriAppleVerified email at apple.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaVerified email at meta.com
Michiel BacchianiGoogle Inc.Verified email at google.com
Nanxin ChenMember of Technical StaffVerified email at openai.com
Jiro NishitobaRetrieva, Inc.Verified email at retrieva.jp
Wangyou ZhangPh.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yuma KoizumiGoogle DeepMindVerified email at google.com
Jahn HeymannApplied Scientist @ AmazonVerified email at amazon.com
Ryuichi YamamotoLY CorporationVerified email at lycorp.co.jp
Xiaofei WangMicrosoftVerified email at jhu.edu
Ziyan JiangAmazon AGIVerified email at amazon.com
Keisuke KinoshitaResearch Scientist at GoogleVerified email at ieee.org
Tomoharu IwataNTTVerified email at hco.ntt.co.jp
Yotaro KuboGoogle SpeechVerified email at ieee.org
Nobutaka ItoUniversity of Tokyo, Japan (formerly NTT)Verified email at ieee.org

Shigeki Karita

Google

Verified email at google.com - Homepage

Machine Learning Speech Recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
ESPnet: End-to-end speech processing toolkit S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	1676	2018
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	859	2019
Improving Transformer-based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration S Karita, NEY Soplin, S Watanabe, M Delcroix, A Ogawa, T Nakatani Proc. Interspeech 2019, 1408-1412, 2019	270	2019
ESPnet-ST: All-in-one speech translation toolkit H Inaguma, S Kiyono, K Duh, S Karita, NEY Soplin, T Hayashi, ... arXiv preprint arXiv:2004.10234, 2020	174	2020
Semi-Supervised End-to-End Speech Recognition S Karita, S Watanabe, T Iwata, A Ogawa, M Delcroix INTERSPEECH, 2-6, 2018	79	2018
Frame-by-frame closed-form update for mask-based adaptive MVDR beamforming T Higuchi, K Kinoshita, N Ito, S Karita, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	64	2018
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... 2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021	56	2021
Semi-Supervised End-to-End Speech Recognition Using Text-to-Speech and Autoencoders S Karita, S Watanabe, T Iwata, M Delcroix, A Ogawa, T Nakatani IEEE International Conference on Acoustics, Speech, and Signal Processing, 2019	53	2019
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement Y Koizumi, S Karita, S Wisdom, H Erdogan, JR Hershey, L Jones, ... 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021	51	2021
Auxiliary feature based adaptation of end-to-end ASR systems M Delcroix, S Watanabe, A Ogawa, S Karita, T Nakatani INTERSPEECH, 2018	47	2018
Far-field speech recognition using CNN-DNN-HMM with convolution in time T Yoshioka, S Karita, T Nakatani 2015 IEEE international conference on acoustics, speech and signal …, 2015	39	2015
Libritts-r: A restored multi-speaker text-to-speech corpus Y Koizumi, H Zen, S Karita, Y Ding, K Yatabe, N Morioka, M Bacchiani, ... arXiv preprint arXiv:2305.18802, 2023	35	2023
Rescoring n-best speech recognition list based on one-on-one hypothesis comparison using encoder-classifier model A Ogawa, M Delcroix, S Karita, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	30	2018
Sequence training of encoder-decoder model using policy gradient for end-to-end speech recognition S Karita, A Ogawa, M Delcroix, T Nakatani IEEE International Conference on Acoustics, Speech and Signal Processing, 2018	30	2018
End-to-End SpeakerBeam for Single Channel Target Speech Recognition. M Delcroix, S Watanabe, T Ochiai, K Kinoshita, S Karita, A Ogawa, ... Interspeech, 451-455, 2019	28	2019
Knowledge transfer from large-scale pretrained language models to end-to-end speech recognizers Y Kubo, S Karita, M Bacchiani ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	27	2022
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020	24	2020
Espnet: End-to-end speech processing toolkit. arXiv 2018 S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ... arXiv preprint arXiv:1804.00015, 2018	20	2018
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming S Araki, N Ito, M Delcroix, A Ogawa, K Kinoshita, T Higuchi, T Yoshioka, ... 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 16-20, 2017	20	2017
Learning device, learning method, and learning program A Ogawa, M Delcroix, S Karita, T Nakatani US Patent App. 16/966,056, 2020	19	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors