Javni dostop

Članki z zahtevami za javni dostop - Shinji WatanabeVeč o tem

Ni na voljo nikjer: 6

Pregled

The first multimodal information based speech processing (misp) challenge: Data, tasks, baselines and results

H Chen, H Zhou, J Du, CH Lee, J Chen, S Watanabe, SM Siniscalchi, ...

ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022

Zahteve: National Natural Science Foundation of China

Pregled

Improving end-to-end single-channel multi-talker speech recognition

W Zhang, X Chang, Y Qian, S Watanabe

IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1385-1394, 2020

Zahteve: National Natural Science Foundation of China

Pregled

Findadaptnet: Find and insert adapters by learned layer importance

J Huang, K Ganesan, S Maiti, YM Kim, X Chang, P Liang, S Watanabe

ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023

Zahteve: US National Science Foundation

Pregled

Espnet-summ: Introducing a novel large dataset, toolkit, and a cross-corpora evaluation of speech summarization systems

R Sharma, W Chen, T Kano, R Sharma, S Arora, S Watanabe, A Ogawa, ...

2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023

Zahteve: US National Science Foundation

Pregled

Summary on the multimodal information based speech processing (MISP) 2022 challenge

H Chen, S Wu, Y Dai, Z Wang, J Du, CH Lee, J Chen, S Watanabe, ...

ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023

Zahteve: National Natural Science Foundation of China

Pregled

E-branchformer-based e2e slu toward stop on-device challenge

Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ...

ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023

Zahteve: US National Science Foundation

Na voljo nekje: 86

[PDF] arxiv.org

Pregled

Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt, JD Havtorn, J Edin, C Igel, K Kirchhoff, ...

IEEE Journal of Selected Topics in Signal Processing 16 (6), 1179-1210, 2022

Zahteve: Innovation Fund Denmark

[PDF] isca-archive.org

Pregled

Improved MVDR beamforming using single-channel mask prediction networks.

H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux

Interspeech, 1981-1985, 2016

Zahteve: US National Science Foundation

[PDF] ed.ac.uk

Pregled

Deep beamforming networks for multi-channel speech recognition

X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...

2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016

Zahteve: US National Science Foundation

[PDF] arxiv.org

Pregled

Torchaudio: Building blocks for audio and speech processing

YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ...

ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022

Zahteve: Department of Science & Technology, India

[PDF] mlr.press

Pregled

Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding

Y Peng, S Dalmia, I Lane, S Watanabe

International Conference on Machine Learning, 17627-17643, 2022

Zahteve: US National Science Foundation

[PDF] arxiv.org

Pregled

MIMO-Speech: End-to-end multi-channel multi-speaker speech recognition

X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe

2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019

Zahteve: National Natural Science Foundation of China

[PDF] arxiv.org

Pregled

End-to-end multi-speaker speech recognition with transformer

X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe

ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020

Zahteve: National Natural Science Foundation of China

[PDF] arxiv.org

Pregled

TF-GridNet: Integrating full-and sub-band modeling for speech separation

ZQ Wang, S Cornell, S Choi, Y Lee, BY Kim, S Watanabe

IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 3221-3236, 2023

Zahteve: US National Science Foundation

[PDF] fbk.eu

Pregled

Findings of the IWSLT 2022 Evaluation Campaign.

A Anastasopoulos, L Barrault, L Bentivogli, MZ Boito, O Bojar, R Cattoni, ...

Proceedings of the 19th international conference on spoken language …, 2022

Zahteve: European Commission

[PDF] arxiv.org

Pregled

Cycle-consistency training for end-to-end speech recognition

T Hori, R Astudillo, T Hayashi, Y Zhang, S Watanabe, J Le Roux

ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019

Zahteve: Fundação para a Ciência e a Tecnologia, Portugal

[PDF] arxiv.org

Pregled

ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration

C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ...

2021 IEEE Spoken Language Technology Workshop (SLT), 785-792, 2021

Zahteve: National Natural Science Foundation of China

[PDF] arxiv.org

Pregled

End-to-end monaural multi-speaker ASR system without pretraining

X Chang, Y Qian, K Yu, S Watanabe

ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019

Zahteve: National Natural Science Foundation of China

[PDF] arxiv.org

Pregled

An exploration of self-supervised pretrained representations for end-to-end speech recognition

X Chang, T Maekaku, P Guo, J Shi, YJ Lu, AS Subramanian, T Wang, ...

2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021

Zahteve: US National Science Foundation

[PDF] arxiv.org

Pregled

Phasebook and friends: Leveraging discrete representations for source separation

J Le Roux, G Wichern, S Watanabe, A Sarroff, JR Hershey

IEEE Journal of Selected Topics in Signal Processing 13 (2), 370-382, 2019

Zahteve: US National Science Foundation

Podatke o objavi in financiranju samodejno določi računalniški program