Spremljaj
Pengcheng Zhu
Pengcheng Zhu
Fuxi AI Lab, NetEase Inc.
Preverjeni e-poštni naslov na corp.netease.com
Naslov
Navedeno
Navedeno
Leto
Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis
Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi
arXiv preprint arXiv:2201.07429, 2022
1112022
Visinger: Variational inference with adversarial learning for end-to-end singing voice synthesis
Y Zhang, J Cong, H Xue, L Xie, P Zhu, M Bi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
872022
Head motion synthesis from speech using deep neural networks
C Ding, L Xie, P Zhu
Multimedia Tools and Applications 74, 9871-9888, 2015
672015
Articulatory movement prediction using deep bidirectional long short-term memory based recurrent neural networks and word/phone embeddings.
P Zhu, L Xie, Y Chen
Interspeech, 2192-2196, 2015
422015
BLSTM neural networks for speech driven head motion synthesis.
C Ding, P Zhu, L Xie
Interspeech, 3345-3349, 2015
292015
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
202023
Improving mandarin end-to-end speech synthesis by self-attention and learnable Gaussian bias
F Yang, S Yang, P Zhu, P Yan, L Xie
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
192019
One-shot voice conversion for style transfer based on speaker adaptation
Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
162022
Speech-driven head motion synthesis using neural networks.
C Ding, P Zhu, L Xie, D Jiang, ZH Fu
INTERSPEECH, 2303-2307, 2014
152014
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding
Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi
arXiv preprint arXiv:2305.12425, 2023
142023
Learn2sing 2.0: Diffusion and mutual information-based target speaker svs by learning from singing teacher
H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi
arXiv preprint arXiv:2203.16408, 2022
112022
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion
Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
72024
Accent-VITS: Accent transfer for end-to-end TTS
L Ma, Y Zhang, X Zhu, Y Lei, Z Ning, P Zhu, L Xie
National Conference on Man-Machine Speech Communication, 203-214, 2023
62023
Speech recognition by selecting and refining hot words
F Jin, W Liu, LJ Ma, PCPP Zhu, Y Qin, Q Shi, SL Zhang
US Patent 10,607,601, 2020
42020
Dualvc 3: Leveraging language model generated pseudo context for end-to-end low latency streaming voice conversion
Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi
arXiv preprint arXiv:2406.07846, 2024
32024
Head motion generation for speech driven talking avatar
B Li, L Xie, P Zhu, B Fan
J Tsinghua Univ (Sci & Tech) 53 (6), 898-902, 2013
32013
E1 tts: Simple and fast non-autoregressive tts
Z Liu, S Wang, P Zhu, M Bi, H Li
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
22025
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
X Wang, M Jiang, Z Ma, Z Zhang, S Liu, L Li, Z Liang, Q Zheng, R Wang, ...
arXiv preprint arXiv:2503.01710, 2025
12025
M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions
S Wang, P Zhu, H Li
International Conference on Social Robotics, 303-311, 2024
12024
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi
arXiv preprint arXiv:2309.15496, 2023
12023
Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.
Članki 1–20