متابعة
Zhiyuan Zhao
Zhiyuan Zhao
Microsoft Research Lab Asia
بريد إلكتروني تم التحقق منه على microsoft.com
عنوان
عدد مرات الاقتباسات
عدد مرات الاقتباسات
السنة
Joint time-frequency and time domain learning for speech enhancement
C Tang, C Luo, Z Zhao, W Xie, W Zeng
Proceedings of the twenty-ninth international conference on international …, 2021
812021
Pinnsformer: A transformer-based framework for physics-informed neural networks
Z Zhao, X Ding, BA Prakash
arXiv preprint arXiv:2307.11833, 2023
442023
Art-v: Auto-regressive text-to-video generation with diffusion models
W Weng, R Feng, Y Wang, Q Dai, C Wang, D Yin, Z Zhao, K Qiu, J Bao, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
292024
Lstprompt: Large language models as zero-shot time series forecasters by long-short-term prompting
H Liu, Z Zhao, J Wang, H Kamarthi, BA Prakash
arXiv preprint arXiv:2402.16132, 2024
242024
Tsi-bench: Benchmarking time series imputation
W Du, J Wang, L Qian, Y Yang, Z Ibrahim, F Liu, Z Wang, H Liu, Z Zhao, ...
arXiv preprint arXiv:2406.12747, 2024
172024
TridentSE: Guiding speech enhancement with 32 global tokens
D Yin, Z Zhao, C Tang, Z Xiong, C Luo
arXiv preprint arXiv:2210.12995, 2022
152022
Microcinema: A divide-and-conquer approach for text-to-video generation
Y Wang, J Bao, W Weng, R Feng, D Yin, T Yang, J Zhang, Q Dai, Z Zhao, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
142024
RetrieverTTS: Modeling decomposed factors for text-based speech insertion
D Yin, C Tang, Y Liu, X Wang, Z Zhao, Y Zhao, Z Xiong, S Zhao, C Luo
arXiv preprint arXiv:2206.13865, 2022
122022
Zero-shot text-to-speech for text-based insertion in audio narration
C Tang, C Luo, Z Zhao, D Yin, Y Zhao, W Zeng
arXiv preprint arXiv:2109.05426, 2021
92021
General-purpose speech representation learning through a self-supervised multi-granularity framework
Y Zhao, D Yin, C Luo, Z Zhao, C Tang, W Zeng, ZJ Zha
arXiv preprint arXiv:2102.01930, 2021
82021
Performative time-series forecasting
Z Zhao, A Rodriguez, BA Prakash
arXiv preprint arXiv:2310.06077, 2023
42023
Time-mmd: A new multi-domain multimodal dataset for time series analysis
H Liu, S Xu, Z Zhao, L Kong, H Kamarthi, AB Sasanur, M Sharma, J Cui, ...
arXiv preprint arXiv:2406.08627, 2024
32024
Time-series forecasting for out-of-distribution generalization using invariant learning
H Liu, H Kamarthi, L Kong, Z Zhao, C Zhang, BA Prakash
arXiv preprint arXiv:2406.09130, 2024
22024
An anchor-free detector for continuous speech keyword spotting
Z Zhao, C Tang, C Yao, C Luo
arXiv preprint arXiv:2208.04622, 2022
22022
Time-mmd: Multi-domain multimodal dataset for time series analysis
H Liu, S Xu, Z Zhao, L Kong, H Prabhakar Kamarthi, A Sasanur, ...
Advances in Neural Information Processing Systems 37, 77888-77933, 2024
12024
Speech enhancement
T Chuanxin, Z Zhao, C Luo, W Zeng
US Patent App. 17/927,861, 2023
2023
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Z Zhao, L Wu, C Tang, D Yin, Y Zhao, C Luo
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
ART• V: Auto-Regressive Text-to-Video Generation with Diffusion Models Supplementary Material
W Weng, R Feng, Y Wang, Q Dai, C Wang, D Yin, Z Zhao, K Qiu, J Bao, ...
context 1024, 1024, 0
يتعذر على النظام إجراء العملية في الوقت الحالي. عاود المحاولة لاحقًا.
مقالات 1–18