Članki z zahtevami za javni dostop - Shinji WatanabeVeč o tem
Ni na voljo nikjer: 6
The first multimodal information based speech processing (misp) challenge: Data, tasks, baselines and results
H Chen, H Zhou, J Du, CH Lee, J Chen, S Watanabe, SM Siniscalchi, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Zahteve: National Natural Science Foundation of China
Improving end-to-end single-channel multi-talker speech recognition
W Zhang, X Chang, Y Qian, S Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1385-1394, 2020
Zahteve: National Natural Science Foundation of China
Findadaptnet: Find and insert adapters by learned layer importance
J Huang, K Ganesan, S Maiti, YM Kim, X Chang, P Liang, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Zahteve: US National Science Foundation
Espnet-summ: Introducing a novel large dataset, toolkit, and a cross-corpora evaluation of speech summarization systems
R Sharma, W Chen, T Kano, R Sharma, S Arora, S Watanabe, A Ogawa, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
Zahteve: US National Science Foundation
Summary on the multimodal information based speech processing (MISP) 2022 challenge
H Chen, S Wu, Y Dai, Z Wang, J Du, CH Lee, J Chen, S Watanabe, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Zahteve: National Natural Science Foundation of China
E-branchformer-based e2e slu toward stop on-device challenge
Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Zahteve: US National Science Foundation
Na voljo nekje: 86
Self-supervised speech representation learning: A review
A Mohamed, H Lee, L Borgholt, JD Havtorn, J Edin, C Igel, K Kirchhoff, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1179-1210, 2022
Zahteve: Innovation Fund Denmark
Improved MVDR beamforming using single-channel mask prediction networks.
H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux
Interspeech, 1981-1985, 2016
Zahteve: US National Science Foundation
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
Zahteve: US National Science Foundation
Torchaudio: Building blocks for audio and speech processing
YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Zahteve: Department of Science & Technology, India
Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding
Y Peng, S Dalmia, I Lane, S Watanabe
International Conference on Machine Learning, 17627-17643, 2022
Zahteve: US National Science Foundation
MIMO-Speech: End-to-end multi-channel multi-speaker speech recognition
X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
Zahteve: National Natural Science Foundation of China
End-to-end multi-speaker speech recognition with transformer
X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
Zahteve: National Natural Science Foundation of China
TF-GridNet: Integrating full-and sub-band modeling for speech separation
ZQ Wang, S Cornell, S Choi, Y Lee, BY Kim, S Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 3221-3236, 2023
Zahteve: US National Science Foundation
Findings of the IWSLT 2022 Evaluation Campaign.
A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, ...
Proceedings of the 19th international conference on spoken language …, 2022
Zahteve: European Commission
Cycle-consistency training for end-to-end speech recognition
T Hori, R Astudillo, T Hayashi, Y Zhang, S Watanabe, J Le Roux
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
Zahteve: Fundação para a Ciência e a Tecnologia, Portugal
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration
C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 785-792, 2021
Zahteve: National Natural Science Foundation of China
End-to-end monaural multi-speaker ASR system without pretraining
X Chang, Y Qian, K Yu, S Watanabe
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
Zahteve: National Natural Science Foundation of China
An exploration of self-supervised pretrained representations for end-to-end speech recognition
X Chang, T Maekaku, P Guo, J Shi, YJ Lu, AS Subramanian, T Wang, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
Zahteve: US National Science Foundation
Phasebook and friends: Leveraging discrete representations for source separation
J Le Roux, G Wichern, S Watanabe, A Sarroff, JR Hershey
IEEE Journal of Selected Topics in Signal Processing 13 (2), 370-382, 2019
Zahteve: US National Science Foundation
Podatke o objavi in financiranju samodejno določi računalniški program