Obserwuj
Songxiang Liu
Songxiang Liu
Meituan, PhD (The Chinese University of Hong Kong)
Zweryfikowany adres z meituan.com - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Speech emotion recognition using capsule networks
X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1402019
Hifi-codec: Group-residual vector quantization for high fidelity audio codec
D Yang, S Liu, R Huang, J Tian, C Weng, Y Zou
arXiv preprint arXiv:2305.02765, 2023
1242023
Uniaudio: An audio foundation model toward universal audio generation
D Yang, J Tian, X Tan, R Huang, S Liu, X Chang, J Shi, S Zhao, J Bian, ...
arXiv preprint arXiv:2310.00704, 2023
1212023
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling
S Liu, Y Cao, D Wang, X Wu, X Liu, H Meng
IEEE/ACM Transactions on Audio Speech and Language Processing, 2020
1122020
Instructtts: Modelling expressive tts in discrete latent space with natural language style prompt
D Yang, S Liu, R Huang, C Weng, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
792024
Adversarial attacks on spoofing countermeasures of automatic speaker verification
S Liu, H Wu, H Lee, H Meng
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
782019
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
S Liu, D Su, D Yu
ICML 2022 Workshop on Machine Learning for Audio Synthesis, 2022
742022
Defense against adversarial attacks on spoofing countermeasures of ASV
H Wu, S Liu, H Meng, H Lee
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
712020
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance.
S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng
Interspeech, 496-500, 2018
692018
Diffsvc: A diffusion probabilistic model for singing voice conversion
S Liu, Y Cao, D Su, H Meng
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
642021
The singing voice conversion challenge 2023
WC Huang, LP Violeta, S Liu, J Shi, T Toda
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
592023
Fastsvc: Fast cross-domain singing voice conversion with feature-wise linear modulation
S Liu, Y Cao, N Hu, D Su, H Meng
2021 ieee international conference on multimedia and expo (icme), 1-6, 2021
512021
End-to-end code-switched tts with mix of monolingual recordings
Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
502019
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction
D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
482020
End-to-end accent conversion without using native utterances
S Liu, D Wang, Y Cao, L Sun, X Wu, S Kang, Z Wu, X Liu, D Su, D Yu, ...
ICASSP 2020, 2020
482020
Speech emotion recognition using sequential capsule networks
X Wu, Y Cao, H Lu, S Liu, D Wang, Z Wu, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3280-3291, 2021
362021
Vara-tts: Non-autoregressive text-to-speech synthesis based on very deep vae with residual attention
P Liu, Y Cao, S Liu, N Hu, G Li, C Weng, D Su
arXiv preprint arXiv:2102.06431, 2021
362021
Code-switched speech synthesis using bilingual phonetic posteriorgram with only monolingual corpora
Y Cao, S Liu, X Wu, S Kang, P Liu, Z Wu, X Liu, D Su, D Yu, H Meng
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
292020
Transferring source style in non-parallel voice conversion
S Liu, Y Cao, S Kang, N Hu, X Liu, D Su, D Yu, H Meng
INTERSPEECH 2020, 2020
252020
ASR-GLUE: A new multi-task benchmark for asr-robust natural language understanding
L Feng, J Yu, D Cai, S Liu, H Zheng, Y Wang
arXiv preprint arXiv:2108.13048, 2021
192021
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20