Seed-tts: A family of high-quality versatile speech generation models P Anastassiou, J Chen, J Chen, Y Chen, Z Chen, Z Chen, J Cong, L Deng, ... arXiv preprint arXiv:2406.02430, 2024 | 61 | 2024 |
Fully integer-based quantization for mobile convolutional neural network inference P Peng, M You, W Xu, J Li Neurocomputing 432, 194-205, 2021 | 25 | 2021 |
Streaming voice conversion via intermediate bottleneck features and non-streaming teacher guidance Y Chen, M Tu, T Li, X Li, Q Kong, J Li, Z Wang, Q Tian, Y Wang, Y Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
A two-domain coordinated sentence similarity scheme for question-answering robots regarding unpredictable outliers and non-orthogonal categories B Li, W Xu, Z Xu, J Li, P Peng Applied Intelligence 51, 8928-8944, 2021 | 5 | 2021 |
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing P Anastassiou, Z Tang, K Peng, D Jia, J Li, M Tu, Y Wang, Y Wang, M Ma arXiv preprint arXiv:2404.06674, 2024 | 4 | 2024 |
Zero-shot accent conversion using pseudo siamese disentanglement network D Jia, Q Tian, K Peng, J Li, Y Chen, M Ma, Y Wang, Y Wang arXiv preprint arXiv:2212.05751, 2022 | 3 | 2022 |