Songxiang Liu

Cytowane przez

	Wszystkie	Od 2020
Cytowania	1519	1489
h-indeks	19	19
i10-indeks	31	30

560

280

140

420

201820192020202120222023202420254 24 98 152 238 294 544 158

Dostęp publiczny

Wyświetl wszystko

22 artykuły

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Obserwuj

Songxiang Liu

Meituan, PhD (The Chinese University of Hong Kong)

Zweryfikowany adres z meituan.com - Strona główna

Multi-Modal LLM Audio foundation model Speech synthesis


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	140	2019
Hifi-codec: Group-residual vector quantization for high fidelity audio codec D Yang, S Liu, R Huang, J Tian, C Weng, Y Zou arXiv preprint arXiv:2305.02765, 2023	124	2023
Uniaudio: An audio foundation model toward universal audio generation D Yang, J Tian, X Tan, R Huang, S Liu, X Chang, J Shi, S Zhao, J Bian, ... arXiv preprint arXiv:2310.00704, 2023	121	2023
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling S Liu, Y Cao, D Wang, X Wu, X Liu, H Meng IEEE/ACM Transactions on Audio Speech and Language Processing, 2020	112	2020
Instructtts: Modelling expressive tts in discrete latent space with natural language style prompt D Yang, S Liu, R Huang, C Weng, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024	79	2024
Adversarial attacks on spoofing countermeasures of automatic speaker verification S Liu, H Wu, H Lee, H Meng 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	78	2019
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs S Liu, D Su, D Yu ICML 2022 Workshop on Machine Learning for Audio Synthesis, 2022	74	2022
Defense against adversarial attacks on spoofing countermeasures of ASV H Wu, S Liu, H Meng, H Lee ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	71	2020
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance. S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng Interspeech, 496-500, 2018	69	2018
Diffsvc: A diffusion probabilistic model for singing voice conversion S Liu, Y Cao, D Su, H Meng 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	64	2021
The singing voice conversion challenge 2023 WC Huang, LP Violeta, S Liu, J Shi, T Toda 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	59	2023
Fastsvc: Fast cross-domain singing voice conversion with feature-wise linear modulation S Liu, Y Cao, N Hu, D Su, H Meng 2021 ieee international conference on multimedia and expo (icme), 1-6, 2021	51	2021
End-to-end code-switched tts with mix of monolingual recordings Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	50	2019
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	48	2020
End-to-end accent conversion without using native utterances S Liu, D Wang, Y Cao, L Sun, X Wu, S Kang, Z Wu, X Liu, D Su, D Yu, ... ICASSP 2020, 2020	48	2020
Speech emotion recognition using sequential capsule networks X Wu, Y Cao, H Lu, S Liu, D Wang, Z Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3280-3291, 2021	36	2021
Vara-tts: Non-autoregressive text-to-speech synthesis based on very deep vae with residual attention P Liu, Y Cao, S Liu, N Hu, G Li, C Weng, D Su arXiv preprint arXiv:2102.06431, 2021	36	2021
Code-switched speech synthesis using bilingual phonetic posteriorgram with only monolingual corpora Y Cao, S Liu, X Wu, S Kang, P Liu, Z Wu, X Liu, D Su, D Yu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	29	2020
Transferring source style in non-parallel voice conversion S Liu, Y Cao, S Kang, N Hu, X Liu, D Su, D Yu, H Meng INTERSPEECH 2020, 2020	25	2020
ASR-GLUE: A new multi-task benchmark for asr-robust natural language understanding L Feng, J Yu, D Cai, S Liu, H Zheng, Y Wang arXiv preprint arXiv:2108.13048, 2021	19	2021

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez