Artikel mit Open-Access-Mandaten - Shinji WatanabeWeitere Informationen
Nicht verfügbar: 3
Findadaptnet: Find and insert adapters by learned layer importance
J Huang, K Ganesan, S Maiti, YM Kim, X Chang, P Liang, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Mandate: US National Science Foundation
Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge
H Chen, S Wu, Y Dai, Z Wang, J Du, CH Lee, J Chen, S Watanabe, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Mandate: National Natural Science Foundation of China
E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge
Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
Mandate: US National Science Foundation
Verfügbar: 78
Improved MVDR beamforming using single-channel mask prediction networks.
H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux
Interspeech, 1981-1985, 2016
Mandate: US National Science Foundation
Self-supervised speech representation learning: A review
A Mohamed, H Lee, L Borgholt, JD Havtorn, J Edin, C Igel, K Kirchhoff, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1179-1210, 2022
Mandate: Innovation Fund Denmark
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
Mandate: US National Science Foundation
Torchaudio: Building blocks for audio and speech processing
YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Mandate: Department of Science & Technology, India
Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding
Y Peng, S Dalmia, I Lane, S Watanabe
International Conference on Machine Learning, 17627-17643, 2022
Mandate: US National Science Foundation
MIMO-Speech: End-to-end multi-channel multi-speaker speech recognition
X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
Mandate: National Natural Science Foundation of China
End-to-end multi-speaker speech recognition with transformer
X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
Mandate: National Natural Science Foundation of China
Findings of the IWSLT 2022 Evaluation Campaign.
A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, ...
Proceedings of the 19th International Conference on Spoken Language …, 2022
Mandate: European Commission
Cycle-consistency training for end-to-end speech recognition
T Hori, R Astudillo, T Hayashi, Y Zhang, S Watanabe, J Le Roux
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
Mandate: Fundação para a Ciência e a Tecnologia, Portugal
TF-GridNet: Integrating full-and sub-band modeling for speech separation
ZQ Wang, S Cornell, S Choi, Y Lee, BY Kim, S Watanabe
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
Mandate: US National Science Foundation
Espresso: A fast end-to-end neural speech recognition toolkit
Y Wang, T Chen, H Xu, S Ding, H Lv, Y Shao, N Peng, L Xie, S Watanabe, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
Mandate: US Office of the Director of National Intelligence
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration
C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 785-792, 2021
Mandate: National Natural Science Foundation of China
End-to-end monaural multi-speaker ASR system without pretraining
X Chang, Y Qian, K Yu, S Watanabe
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
Mandate: National Natural Science Foundation of China
Phasebook and friends: Leveraging discrete representations for source separation
J Le Roux, G Wichern, S Watanabe, A Sarroff, JR Hershey
IEEE Journal of Selected Topics in Signal Processing 13 (2), 370-382, 2019
Mandate: US National Science Foundation
An exploration of self-supervised pretrained representations for end-to-end speech recognition
X Chang, T Maekaku, P Guo, J Shi, YJ Lu, AS Subramanian, T Wang, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
Mandate: US National Science Foundation
Espnet-slu: Advancing spoken language understanding through espnet
S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Mandate: US National Science Foundation
Sequence summarizing neural network for speaker adaptation
K Veselý, S Watanabe, K Žmolíková, M Karafiát, L Burget, JH Černocký
2016 IEEE international conference on acoustics, speech and signal …, 2016
Mandate: US National Science Foundation
Angaben zur Publikation und Finanzierung werden automatisch von einem Computerprogramm ermittelt