ShiLiang Zhang

引用先

	すべて	2020 年以来
引用	2711	2300
h 指標	27	24
i10 指標	54	44

960

480

240

720

2015201620172018201920202021202220232024202530 60 69 111 126 166 214 245 355 951 369

オープンアクセス

すべて表示

18 件の論文

6 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Zhi-Jie YaniDST, Alibaba Inc.確認したメールアドレス: alibaba-inc.com
Zhihao DuAlibaba確認したメールアドレス: alibaba-inc.com
gao zhifuTongyi Lab, Alibaba Group確認したメールアドレス: alibaba-inc.com
Qian Chen (陈谦)Alibaba Group確認したメールアドレス: alibaba-inc.com
Fan YuNorthwestern Polytechnical University確認したメールアドレス: mail.nwpu.edu.cn
Hui JiangProfessor of Electrical Engineering and Computer Science, York University確認したメールアドレス: cse.yorku.ca
Lei XieNorthwestern Polytechnical University確認したメールアドレス: nwpu.edu.cn
Ian McLoughlinProfessor Singapore Institute of Technology (Singapore) and USTC (China)確認したメールアドレス: singaporetech.edu.sg
Jianshu ZhangiFLYTEK Research確認したメールアドレス: iflytek.com
Jun DuProfessor, NERC-SLIP, USTC確認したメールアドレス: ustc.edu.cn
Bin MaAlibaba確認したメールアドレス: alibaba-inc.com

フォロー

ShiLiang Zhang

SpeechLab，Alibaba

確認したメールアドレス: mail.ustc.edu.cn

Deep Learning ASR，TTS，LLM


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Qwen-audio: Advancing universal audio understanding via unified large-scale audio-language models Y Chu, J Xu, X Zhou, Q Yang, S Zhang, Z Yan, C Zhou, J Zhou arXiv preprint arXiv:2311.07919, 2023	262	2023
Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition J Zhang, J Du, S Zhang, D Liu, Y Hu, J Hu, S Wei, L Dai Pattern Recognition 71, 196-206, 2017	259	2017
Deep-FSMN for large vocabulary continuous speech recognition S Zhang, M Lei, Z Yan, L Dai 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	141	2018
Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition Z Gao, S Zhang, I McLoughlin, Z Yan arXiv preprint arXiv:2206.08317, 2022	109	2022
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	105	2022
emotion2vec: Self-supervised pre-training for speech emotion representation Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen arXiv preprint arXiv:2312.15185, 2023	97	2023
Feedforward sequential memory networks: A new structure to learn long-term dependency S Zhang, C Liu, H Jiang, S Wei, L Dai, Y Hu arXiv preprint arXiv:1512.08301, 2015	91	2015
The Fixed-Size Ordinally-Forgetting Encoding Method for Neural Network Language Models S Zhang, H Jiang, M Xu, J Hou, L Dai ACL2015, 495, 2015	88	2015
Cosyvoice: A scalable multilingual zero-shot text-to-speech synthesizer based on supervised semantic tokens Z Du, Q Chen, S Zhang, K Hu, H Lu, Y Yang, H Hu, S Zheng, Y Gu, Z Ma, ... arXiv preprint arXiv:2407.05407, 2024	79	2024
Lauragpt: Listen, attend, understand, and regenerate audio with gpt Z Du, J Wang, Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, ... arXiv preprint arXiv:2310.04673, 2023	73	2023
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge HB Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng ... ICASSP, 2022	63*	2022
Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition. S Zhang, M Lei, Z Yan Interspeech, 2180-2184, 2019	63*	2019
MDERank: A masked document embedding rank approach for unsupervised keyphrase extraction L Zhang, Q Chen, W Wang, C Deng, SL Zhang, B Li, W Wang, X Cao arXiv preprint arXiv:2110.06651, 2021	62	2021
FunASR: A fundamental end-to-end speech recognition toolkit Z Gao, Z Li, J Wang, H Luo, X Shi, M Chen, Y Li, L Zuo, Z Du, Z Xiao, ... Proc. INTERSPEECH, 2023	61	2023
Funcodec: A fundamental, reproducible and integrable open-source toolkit for neural speech codec Z Du, S Zhang, K Hu, S Zheng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	59	2024
Simplified self-attention for transformer-based end-to-end speech recognition H Luo, S Zhang, M Lei, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 75-81, 2021	56	2021
Prosospeech: Enhancing prosody with quantized vector pre-training in text-to-speech Y Ren, M Lei, Z Huang, S Zhang, Q Chen, Z Yan, Z Zhao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	45	2022
Robust audio-visual speech recognition using bimodal DFSMN with multi-condition training and dropout regularization S Zhang, M Lei, B Ma, L Xie ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019	44	2019
Gaussian Prediction Based Attention for Online End-to-End Speech Recognition. J Hou, S Zhang, LR Dai Interspeech, 3692-3696, 2017	40	2017
Improving deep neural networks for LVCSR using dropout and shrinking structure S Zhang, Y Bao, P Zhou, H Jiang, L Dai 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014	40	2014

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者