Spremljaj
Zhichao Wang
Zhichao Wang
Preverjeni e-poštni naslov na mail.nwpu.edu.cn
Naslov
Navedeno
Navedeno
Leto
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1448-1460, 2022
442022
Lm-vc: Zero-shot voice conversion via speech generation based on language models
Z Wang, Y Chen, L Xie, Q Tian, Y Wang
IEEE Signal Processing Letters 30, 1157-1161, 2023
422023
Accent and speaker disentanglement in many-to-many voice conversion
Z Wang, W Ge, X Wang, S Yang, W Gan, H Chen, H Li, L Xie, X Li
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
372021
Enriching source style transfer in recognition-synthesis based non-parallel voice conversion
Z Wang, X Zhou, F Yang, T Li, H Du, L Xie, W Gan, H Chen, H Li
arXiv preprint arXiv:2106.08741, 2021
212021
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
202023
One-shot voice conversion for style transfer based on speaker adaptation
Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
162022
Cross-speaker emotion transfer based on prosody compensation for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, M Jiang, L Xie
arXiv preprint arXiv:2207.01198, 2022
152022
Multi-speaker multi-style text-to-speech synthesis with single-speaker single-style training data scenarios
Q Xie, T Li, X Wang, Z Wang, L Xie, G Yu, G Wan
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
122022
The NUS & NWPU system for voice conversion challenge 2020
X Tian, Z Wang, S Yang, X Zhou, H Du, Y Zhou, M Zhang, K Zhou, ...
Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020
112020
Vits-based singing voice conversion leveraging whisper and multi-scale f0 modeling
Z Ning, Y Jiang, Z Wang, B Zhang, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
92023
IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion
W Gan, B Wen, Y Yan, H Chen, Z Wang, H Du, L Xie, K Guo, H Li
arXiv preprint arXiv:2201.00269, 2022
92022
Streamvoice: Streamable context-aware language modeling for real-time zero-shot voice conversion
Z Wang, Y Chen, X Wang, L Xie, Y Wang
arXiv preprint arXiv:2401.11053, 2024
82024
Streaming voice conversion via intermediate bottleneck features and non-streaming teacher guidance
Y Chen, M Tu, T Li, X Li, Q Kong, J Li, Z Wang, Q Tian, Y Wang, Y Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
AccentSpeech: Learning accent from crowd-sourced data for target speaker TTS with accents
Y Zhang, Z Wang, P Yang, H Sun, Z Wang, L Xie
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
62022
Msm-vc: High-fidelity source style transfer for non-parallel voice conversion by multi-scale style modeling
Z Wang, X Wang, Q Xie, T Li, L Xie, Q Tian, Y Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 3883-3895, 2023
42023
Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy
L Ma, X Zhu, Y Lv, Z Wang, Z Wang, W He, H Zhou, L Xie
arXiv preprint arXiv:2406.09844, 2024
32024
Dualvc 3: Leveraging language model generated pseudo context for end-to-end low latency streaming voice conversion
Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi
arXiv preprint arXiv:2406.07846, 2024
32024
Delivering speaking style in low-resource voice conversion with multi-factor constraints
Z Wang, X Wang, L Xie, Y Chen, Q Tian, Y Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
32023
StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion
Z Wang, Y Chen, X Wang, L Xie, Y Wang
IEEE Signal Processing Letters, 2024
22024
U-Style: Cascading U-Nets With Multi-Level Speaker and Style Modeling for Zero-Shot Voice Cloning
T Li, Z Wang, X Zhu, J Cong, Q Tian, Y Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
22024
Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.
Članki 1–20